Dataset statistics
| Number of variables | 36 |
|---|---|
| Number of observations | 413412 |
| Missing cells | 2449654 |
| Missing cells (%) | 16.5% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 113.5 MiB |
| Average record size in memory | 288.0 B |
Variable types
| Numeric | 11 |
|---|---|
| Categorical | 25 |
CMPLNT_FR_DT has a high cardinality: 1792 distinct values | High cardinality |
CMPLNT_FR_TM has a high cardinality: 1440 distinct values | High cardinality |
CMPLNT_TO_DT has a high cardinality: 1264 distinct values | High cardinality |
CMPLNT_TO_TM has a high cardinality: 1440 distinct values | High cardinality |
OFNS_DESC has a high cardinality: 59 distinct values | High cardinality |
PARKS_NM has a high cardinality: 508 distinct values | High cardinality |
PD_DESC has a high cardinality: 336 distinct values | High cardinality |
PREM_TYP_DESC has a high cardinality: 74 distinct values | High cardinality |
RPT_DT has a high cardinality: 366 distinct values | High cardinality |
STATION_NAME has a high cardinality: 362 distinct values | High cardinality |
Lat_Lon has a high cardinality: 67403 distinct values | High cardinality |
New Georeferenced Column has a high cardinality: 67403 distinct values | High cardinality |
X_COORD_CD is highly correlated with Longitude | High correlation |
Y_COORD_CD is highly correlated with Latitude | High correlation |
Latitude is highly correlated with Y_COORD_CD | High correlation |
Longitude is highly correlated with X_COORD_CD | High correlation |
LAW_CAT_CD is highly correlated with OFNS_DESC | High correlation |
BORO_NM is highly correlated with HADEVELOPT and 1 other fields | High correlation |
HADEVELOPT is highly correlated with BORO_NM and 1 other fields | High correlation |
OFNS_DESC is highly correlated with LAW_CAT_CD | High correlation |
PATROL_BORO is highly correlated with BORO_NM and 1 other fields | High correlation |
CMPLNT_TO_DT has 39104 (9.5%) missing values | Missing |
CMPLNT_TO_TM has 38979 (9.4%) missing values | Missing |
HADEVELOPT has 411842 (99.6%) missing values | Missing |
HOUSING_PSA has 382445 (92.5%) missing values | Missing |
LOC_OF_OCCUR_DESC has 66086 (16.0%) missing values | Missing |
PARKS_NM has 410736 (99.4%) missing values | Missing |
STATION_NAME has 406179 (98.3%) missing values | Missing |
SUSP_AGE_GROUP has 94862 (22.9%) missing values | Missing |
SUSP_RACE has 94862 (22.9%) missing values | Missing |
SUSP_SEX has 94862 (22.9%) missing values | Missing |
TRANSIT_DISTRICT has 406179 (98.3%) missing values | Missing |
CMPLNT_NUM has unique values | Unique |
JURISDICTION_CODE has 371665 (89.9%) zeros | Zeros |
Reproduction
| Analysis started | 2021-03-06 21:34:46.900655 |
|---|---|
| Analysis finished | 2021-03-06 21:39:34.857821 |
| Duration | 4 minutes and 47.96 seconds |
| Software version | pandas-profiling v2.10.1 |
| Download configuration | config.yaml |
| Distinct | 413412 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 549358625.6 |
|---|---|
| Minimum | 100001361 |
| Maximum | 999998911 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 3.2 MiB |
Quantile statistics
| Minimum | 100001361 |
|---|---|
| 5-th percentile | 144865971.2 |
| Q1 | 323777360.8 |
| median | 549621616 |
| Q3 | 774108179.5 |
| 95-th percentile | 954860935.4 |
| Maximum | 999998911 |
| Range | 899997550 |
| Interquartile range (IQR) | 450330818.8 |
Descriptive statistics
| Standard deviation | 259732435.9 |
|---|---|
| Coefficient of variation (CV) | 0.4727921321 |
| Kurtosis | -1.199660895 |
| Mean | 549358625.6 |
| Median Absolute Deviation (MAD) | 225219565 |
| Skewness | 0.002008709704 |
| Sum | 2.271114481 × 1014 |
| Variance | 6.746093825 × 1016 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 635441150 | 1 | < 0.1% |
| 421827135 | 1 | < 0.1% |
| 588528170 | 1 | < 0.1% |
| 652489259 | 1 | < 0.1% |
| 307497516 | 1 | < 0.1% |
| 483715632 | 1 | < 0.1% |
| 362078769 | 1 | < 0.1% |
| 253020724 | 1 | < 0.1% |
| 277127737 | 1 | < 0.1% |
| 557101627 | 1 | < 0.1% |
| Other values (413402) | 413402 |
| Value | Count | Frequency (%) |
| 100001361 | 1 | |
| 100003492 | 1 | |
| 100005651 | 1 | |
| 100008265 | 1 | |
| 100010196 | 1 |
| Value | Count | Frequency (%) |
| 999998911 | 1 | |
| 999997084 | 1 | |
| 999993654 | 1 | |
| 999993350 | 1 | |
| 999987110 | 1 |
ADDR_PCT_CD
Real number (ℝ≥0)
| Distinct | 77 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 63.9899761 |
|---|---|
| Minimum | 1 |
| Maximum | 123 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 3.2 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 9 |
| Q1 | 40 |
| median | 66 |
| Q3 | 101 |
| 95-th percentile | 115 |
| Maximum | 123 |
| Range | 122 |
| Interquartile range (IQR) | 61 |
Descriptive statistics
| Standard deviation | 34.47196848 |
|---|---|
| Coefficient of variation (CV) | 0.5387088819 |
| Kurtosis | -1.155445096 |
| Mean | 63.9899761 |
| Median Absolute Deviation (MAD) | 28 |
| Skewness | 0.0356859667 |
| Sum | 26454224 |
| Variance | 1188.316611 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 75 | 12814 | 3.1% |
| 40 | 10630 | 2.6% |
| 44 | 9579 | 2.3% |
| 43 | 9369 | 2.3% |
| 47 | 8966 | 2.2% |
| 114 | 8850 | 2.1% |
| 46 | 8775 | 2.1% |
| 52 | 8288 | 2.0% |
| 42 | 7860 | 1.9% |
| 73 | 7689 | 1.9% |
| Other values (67) | 320592 |
| Value | Count | Frequency (%) |
| 1 | 4960 | |
| 5 | 3320 | |
| 6 | 4613 | |
| 7 | 3860 | |
| 9 | 4222 |
| Value | Count | Frequency (%) |
| 123 | 2024 | 0.5% |
| 122 | 4235 | |
| 121 | 4899 | |
| 120 | 5845 | |
| 115 | 7197 |
| Distinct | 5 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 485 |
| Missing (%) | 0.1% |
| Memory size | 3.2 MiB |
| BROOKLYN | |
|---|---|
| MANHATTAN | |
| BRONX | |
| QUEENS | |
| STATEN ISLAND |
Length
| Max length | 13 |
|---|---|
| Median length | 8 |
| Mean length | 7.353670261 |
| Min length | 5 |
Characters and Unicode
| Total characters | 3036529 |
|---|---|
| Distinct characters | 19 |
| Distinct categories | 2 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | BRONX |
|---|---|
| 2nd row | BRONX |
| 3rd row | BRONX |
| 4th row | BRONX |
| 5th row | QUEENS |
| Value | Count | Frequency (%) |
| BROOKLYN | 119208 | |
| MANHATTAN | 97365 | |
| BRONX | 90446 | |
| QUEENS | 88922 | |
| STATEN ISLAND | 16986 | 4.1% |
| (Missing) | 485 | 0.1% |
| Value | Count | Frequency (%) |
| brooklyn | 119208 | |
| manhattan | 97365 | |
| bronx | 90446 | |
| queens | 88922 | |
| staten | 16986 | 4.0% |
| island | 16986 | 4.0% |
Most occurring characters
| Value | Count | Frequency (%) |
| N | 527278 | |
| O | 328862 | |
| A | 326067 | |
| T | 228702 | 7.5% |
| B | 209654 | 6.9% |
| R | 209654 | 6.9% |
| E | 194830 | 6.4% |
| L | 136194 | 4.5% |
| S | 122894 | 4.0% |
| K | 119208 | 3.9% |
| Other values (9) | 633186 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 3019543 | |
| Space Separator | 16986 | 0.6% |
Most frequent character per category
| Value | Count | Frequency (%) |
| N | 527278 | |
| O | 328862 | |
| A | 326067 | |
| T | 228702 | 7.6% |
| B | 209654 | 6.9% |
| R | 209654 | 6.9% |
| E | 194830 | 6.5% |
| L | 136194 | 4.5% |
| S | 122894 | 4.1% |
| K | 119208 | 3.9% |
| Other values (8) | 616200 |
| Value | Count | Frequency (%) |
| 16986 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 3019543 | |
| Common | 16986 | 0.6% |
Most frequent character per script
| Value | Count | Frequency (%) |
| N | 527278 | |
| O | 328862 | |
| A | 326067 | |
| T | 228702 | 7.6% |
| B | 209654 | 6.9% |
| R | 209654 | 6.9% |
| E | 194830 | 6.5% |
| L | 136194 | 4.5% |
| S | 122894 | 4.1% |
| K | 119208 | 3.9% |
| Other values (8) | 616200 |
| Value | Count | Frequency (%) |
| 16986 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 3036529 |
Most frequent character per block
| Value | Count | Frequency (%) |
| N | 527278 | |
| O | 328862 | |
| A | 326067 | |
| T | 228702 | 7.5% |
| B | 209654 | 6.9% |
| R | 209654 | 6.9% |
| E | 194830 | 6.4% |
| L | 136194 | 4.5% |
| S | 122894 | 4.0% |
| K | 119208 | 3.9% |
| Other values (9) | 633186 |
| Distinct | 1792 |
|---|---|
| Distinct (%) | 0.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 3.2 MiB |
| 06/01/2020 | 1857 |
|---|---|
| 01/01/2020 | 1733 |
| 01/15/2020 | 1439 |
| 06/02/2020 | 1392 |
| 08/01/2020 | 1388 |
| Other values (1787) |
Length
| Max length | 10 |
|---|---|
| Median length | 10 |
| Mean length | 10 |
| Min length | 10 |
Characters and Unicode
| Total characters | 4134120 |
|---|---|
| Distinct characters | 11 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 797 ? |
|---|---|
| Unique (%) | 0.2% |
Sample
| 1st row | 12/23/2020 |
|---|---|
| 2nd row | 12/21/2020 |
| 3rd row | 11/22/2020 |
| 4th row | 11/22/2020 |
| 5th row | 11/21/2020 |
| Value | Count | Frequency (%) |
| 06/01/2020 | 1857 | 0.4% |
| 01/01/2020 | 1733 | 0.4% |
| 01/15/2020 | 1439 | 0.3% |
| 06/02/2020 | 1392 | 0.3% |
| 08/01/2020 | 1388 | 0.3% |
| 01/31/2020 | 1377 | 0.3% |
| 01/14/2020 | 1371 | 0.3% |
| 03/13/2020 | 1366 | 0.3% |
| 08/14/2020 | 1365 | 0.3% |
| 10/23/2020 | 1360 | 0.3% |
| Other values (1782) | 398764 |
| Value | Count | Frequency (%) |
| 06/01/2020 | 1857 | 0.4% |
| 01/01/2020 | 1733 | 0.4% |
| 01/15/2020 | 1439 | 0.3% |
| 06/02/2020 | 1392 | 0.3% |
| 08/01/2020 | 1388 | 0.3% |
| 01/31/2020 | 1377 | 0.3% |
| 01/14/2020 | 1371 | 0.3% |
| 03/13/2020 | 1366 | 0.3% |
| 08/14/2020 | 1365 | 0.3% |
| 10/23/2020 | 1360 | 0.3% |
| Other values (1782) | 398764 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 1328012 | |
| 2 | 1061817 | |
| / | 826824 | |
| 1 | 376303 | 9.1% |
| 3 | 93187 | 2.3% |
| 9 | 82130 | 2.0% |
| 8 | 78604 | 1.9% |
| 7 | 76355 | 1.8% |
| 5 | 73065 | 1.8% |
| 6 | 72235 | 1.7% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 3307296 | |
| Other Punctuation | 826824 | 20.0% |
Most frequent character per category
| Value | Count | Frequency (%) |
| 0 | 1328012 | |
| 2 | 1061817 | |
| 1 | 376303 | 11.4% |
| 3 | 93187 | 2.8% |
| 9 | 82130 | 2.5% |
| 8 | 78604 | 2.4% |
| 7 | 76355 | 2.3% |
| 5 | 73065 | 2.2% |
| 6 | 72235 | 2.2% |
| 4 | 65588 | 2.0% |
| Value | Count | Frequency (%) |
| / | 826824 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 4134120 |
Most frequent character per script
| Value | Count | Frequency (%) |
| 0 | 1328012 | |
| 2 | 1061817 | |
| / | 826824 | |
| 1 | 376303 | 9.1% |
| 3 | 93187 | 2.3% |
| 9 | 82130 | 2.0% |
| 8 | 78604 | 1.9% |
| 7 | 76355 | 1.8% |
| 5 | 73065 | 1.8% |
| 6 | 72235 | 1.7% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 4134120 |
Most frequent character per block
| Value | Count | Frequency (%) |
| 0 | 1328012 | |
| 2 | 1061817 | |
| / | 826824 | |
| 1 | 376303 | 9.1% |
| 3 | 93187 | 2.3% |
| 9 | 82130 | 2.0% |
| 8 | 78604 | 1.9% |
| 7 | 76355 | 1.8% |
| 5 | 73065 | 1.8% |
| 6 | 72235 | 1.7% |
| Distinct | 1440 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 3.2 MiB |
| 12:00:00 | 10567 |
|---|---|
| 15:00:00 | 8696 |
| 18:00:00 | 8654 |
| 17:00:00 | 8405 |
| 20:00:00 | 8221 |
| Other values (1435) |
Length
| Max length | 8 |
|---|---|
| Median length | 8 |
| Mean length | 8 |
| Min length | 8 |
Characters and Unicode
| Total characters | 3307296 |
|---|---|
| Distinct characters | 11 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 19:50:00 |
|---|---|
| 2nd row | 01:10:00 |
| 3rd row | 22:00:00 |
| 4th row | 09:50:00 |
| 5th row | 15:38:00 |
| Value | Count | Frequency (%) |
| 12:00:00 | 10567 | 2.6% |
| 15:00:00 | 8696 | 2.1% |
| 18:00:00 | 8654 | 2.1% |
| 17:00:00 | 8405 | 2.0% |
| 20:00:00 | 8221 | 2.0% |
| 16:00:00 | 7981 | 1.9% |
| 19:00:00 | 7766 | 1.9% |
| 14:00:00 | 6986 | 1.7% |
| 21:00:00 | 6739 | 1.6% |
| 22:00:00 | 6720 | 1.6% |
| Other values (1430) | 332677 |
| Value | Count | Frequency (%) |
| 12:00:00 | 10567 | 2.6% |
| 15:00:00 | 8696 | 2.1% |
| 18:00:00 | 8654 | 2.1% |
| 17:00:00 | 8405 | 2.0% |
| 20:00:00 | 8221 | 2.0% |
| 16:00:00 | 7981 | 1.9% |
| 19:00:00 | 7766 | 1.9% |
| 14:00:00 | 6986 | 1.7% |
| 21:00:00 | 6739 | 1.6% |
| 22:00:00 | 6720 | 1.6% |
| Other values (1430) | 332677 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 1428155 | |
| : | 826824 | |
| 1 | 329855 | 10.0% |
| 2 | 182711 | 5.5% |
| 5 | 139785 | 4.2% |
| 3 | 133412 | 4.0% |
| 4 | 86310 | 2.6% |
| 8 | 49910 | 1.5% |
| 9 | 47333 | 1.4% |
| 7 | 43263 | 1.3% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 2480472 | |
| Other Punctuation | 826824 | 25.0% |
Most frequent character per category
| Value | Count | Frequency (%) |
| 0 | 1428155 | |
| 1 | 329855 | 13.3% |
| 2 | 182711 | 7.4% |
| 5 | 139785 | 5.6% |
| 3 | 133412 | 5.4% |
| 4 | 86310 | 3.5% |
| 8 | 49910 | 2.0% |
| 9 | 47333 | 1.9% |
| 7 | 43263 | 1.7% |
| 6 | 39738 | 1.6% |
| Value | Count | Frequency (%) |
| : | 826824 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 3307296 |
Most frequent character per script
| Value | Count | Frequency (%) |
| 0 | 1428155 | |
| : | 826824 | |
| 1 | 329855 | 10.0% |
| 2 | 182711 | 5.5% |
| 5 | 139785 | 4.2% |
| 3 | 133412 | 4.0% |
| 4 | 86310 | 2.6% |
| 8 | 49910 | 1.5% |
| 9 | 47333 | 1.4% |
| 7 | 43263 | 1.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 3307296 |
Most frequent character per block
| Value | Count | Frequency (%) |
| 0 | 1428155 | |
| : | 826824 | |
| 1 | 329855 | 10.0% |
| 2 | 182711 | 5.5% |
| 5 | 139785 | 4.2% |
| 3 | 133412 | 4.0% |
| 4 | 86310 | 2.6% |
| 8 | 49910 | 1.5% |
| 9 | 47333 | 1.4% |
| 7 | 43263 | 1.3% |
| Distinct | 1264 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 39104 |
| Missing (%) | 9.5% |
| Memory size | 3.2 MiB |
| 06/01/2020 | 1461 |
|---|---|
| 06/02/2020 | 1425 |
| 10/21/2020 | 1272 |
| 01/15/2020 | 1271 |
| 01/14/2020 | 1239 |
| Other values (1259) |
Length
| Max length | 10 |
|---|---|
| Median length | 10 |
| Mean length | 10 |
| Min length | 10 |
Characters and Unicode
| Total characters | 3743080 |
|---|---|
| Distinct characters | 11 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 517 ? |
|---|---|
| Unique (%) | 0.1% |
Sample
| 1st row | 06/01/2020 |
|---|---|
| 2nd row | 12/29/2020 |
| 3rd row | 12/23/2020 |
| 4th row | 12/31/2020 |
| 5th row | 12/22/2020 |
| Value | Count | Frequency (%) |
| 06/01/2020 | 1461 | 0.4% |
| 06/02/2020 | 1425 | 0.3% |
| 10/21/2020 | 1272 | 0.3% |
| 01/15/2020 | 1271 | 0.3% |
| 01/14/2020 | 1239 | 0.3% |
| 01/31/2020 | 1238 | 0.3% |
| 01/01/2020 | 1230 | 0.3% |
| 02/03/2020 | 1227 | 0.3% |
| 09/08/2020 | 1223 | 0.3% |
| 10/06/2020 | 1212 | 0.3% |
| Other values (1254) | 361510 | |
| (Missing) | 39104 | 9.5% |
| Value | Count | Frequency (%) |
| 06/01/2020 | 1461 | 0.4% |
| 06/02/2020 | 1425 | 0.4% |
| 10/21/2020 | 1272 | 0.3% |
| 01/15/2020 | 1271 | 0.3% |
| 01/14/2020 | 1239 | 0.3% |
| 01/31/2020 | 1238 | 0.3% |
| 01/01/2020 | 1230 | 0.3% |
| 02/03/2020 | 1227 | 0.3% |
| 09/08/2020 | 1223 | 0.3% |
| 10/06/2020 | 1212 | 0.3% |
| Other values (1254) | 361510 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 1202748 | |
| 2 | 966576 | |
| / | 748616 | |
| 1 | 336845 | 9.0% |
| 3 | 84175 | 2.2% |
| 9 | 72337 | 1.9% |
| 8 | 71191 | 1.9% |
| 7 | 69351 | 1.9% |
| 5 | 66162 | 1.8% |
| 6 | 65795 | 1.8% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 2994464 | |
| Other Punctuation | 748616 | 20.0% |
Most frequent character per category
| Value | Count | Frequency (%) |
| 0 | 1202748 | |
| 2 | 966576 | |
| 1 | 336845 | 11.2% |
| 3 | 84175 | 2.8% |
| 9 | 72337 | 2.4% |
| 8 | 71191 | 2.4% |
| 7 | 69351 | 2.3% |
| 5 | 66162 | 2.2% |
| 6 | 65795 | 2.2% |
| 4 | 59284 | 2.0% |
| Value | Count | Frequency (%) |
| / | 748616 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 3743080 |
Most frequent character per script
| Value | Count | Frequency (%) |
| 0 | 1202748 | |
| 2 | 966576 | |
| / | 748616 | |
| 1 | 336845 | 9.0% |
| 3 | 84175 | 2.2% |
| 9 | 72337 | 1.9% |
| 8 | 71191 | 1.9% |
| 7 | 69351 | 1.9% |
| 5 | 66162 | 1.8% |
| 6 | 65795 | 1.8% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 3743080 |
Most frequent character per block
| Value | Count | Frequency (%) |
| 0 | 1202748 | |
| 2 | 966576 | |
| / | 748616 | |
| 1 | 336845 | 9.0% |
| 3 | 84175 | 2.2% |
| 9 | 72337 | 1.9% |
| 8 | 71191 | 1.9% |
| 7 | 69351 | 1.9% |
| 5 | 66162 | 1.8% |
| 6 | 65795 | 1.8% |
| Distinct | 1440 |
|---|---|
| Distinct (%) | 0.4% |
| Missing | 38979 |
| Missing (%) | 9.4% |
| Memory size | 3.2 MiB |
| 12:00:00 | 6091 |
|---|---|
| 15:00:00 | 5491 |
| 10:00:00 | 4977 |
| 08:00:00 | 4876 |
| 17:00:00 | 4830 |
| Other values (1435) |
Length
| Max length | 8 |
|---|---|
| Median length | 8 |
| Mean length | 8 |
| Min length | 8 |
Characters and Unicode
| Total characters | 2995464 |
|---|---|
| Distinct characters | 11 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 13:30:00 |
|---|---|
| 2nd row | 22:12:00 |
| 3rd row | 07:34:00 |
| 4th row | 09:00:00 |
| 5th row | 12:22:00 |
| Value | Count | Frequency (%) |
| 12:00:00 | 6091 | 1.5% |
| 15:00:00 | 5491 | 1.3% |
| 10:00:00 | 4977 | 1.2% |
| 08:00:00 | 4876 | 1.2% |
| 17:00:00 | 4830 | 1.2% |
| 16:00:00 | 4769 | 1.2% |
| 09:00:00 | 4765 | 1.2% |
| 14:00:00 | 4522 | 1.1% |
| 13:00:00 | 4477 | 1.1% |
| 18:00:00 | 4440 | 1.1% |
| Other values (1430) | 325195 | |
| (Missing) | 38979 | 9.4% |
| Value | Count | Frequency (%) |
| 12:00:00 | 6091 | 1.6% |
| 15:00:00 | 5491 | 1.5% |
| 10:00:00 | 4977 | 1.3% |
| 08:00:00 | 4876 | 1.3% |
| 17:00:00 | 4830 | 1.3% |
| 16:00:00 | 4769 | 1.3% |
| 09:00:00 | 4765 | 1.3% |
| 14:00:00 | 4522 | 1.2% |
| 13:00:00 | 4477 | 1.2% |
| 18:00:00 | 4440 | 1.2% |
| Other values (1430) | 325195 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 1215010 | |
| : | 748866 | |
| 1 | 309012 | 10.3% |
| 2 | 164481 | 5.5% |
| 5 | 155010 | 5.2% |
| 3 | 130332 | 4.4% |
| 4 | 87381 | 2.9% |
| 8 | 49409 | 1.6% |
| 9 | 47345 | 1.6% |
| 7 | 45773 | 1.5% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 2246598 | |
| Other Punctuation | 748866 | 25.0% |
Most frequent character per category
| Value | Count | Frequency (%) |
| 0 | 1215010 | |
| 1 | 309012 | 13.8% |
| 2 | 164481 | 7.3% |
| 5 | 155010 | 6.9% |
| 3 | 130332 | 5.8% |
| 4 | 87381 | 3.9% |
| 8 | 49409 | 2.2% |
| 9 | 47345 | 2.1% |
| 7 | 45773 | 2.0% |
| 6 | 42845 | 1.9% |
| Value | Count | Frequency (%) |
| : | 748866 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 2995464 |
Most frequent character per script
| Value | Count | Frequency (%) |
| 0 | 1215010 | |
| : | 748866 | |
| 1 | 309012 | 10.3% |
| 2 | 164481 | 5.5% |
| 5 | 155010 | 5.2% |
| 3 | 130332 | 4.4% |
| 4 | 87381 | 2.9% |
| 8 | 49409 | 1.6% |
| 9 | 47345 | 1.6% |
| 7 | 45773 | 1.5% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2995464 |
Most frequent character per block
| Value | Count | Frequency (%) |
| 0 | 1215010 | |
| : | 748866 | |
| 1 | 309012 | 10.3% |
| 2 | 164481 | 5.5% |
| 5 | 155010 | 5.2% |
| 3 | 130332 | 4.4% |
| 4 | 87381 | 2.9% |
| 8 | 49409 | 1.6% |
| 9 | 47345 | 1.6% |
| 7 | 45773 | 1.5% |
CRM_ATPT_CPTD_CD
Categorical
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 3.2 MiB |
| COMPLETED | |
|---|---|
| ATTEMPTED | 6508 |
Length
| Max length | 9 |
|---|---|
| Median length | 9 |
| Mean length | 9 |
| Min length | 9 |
Characters and Unicode
| Total characters | 3720708 |
|---|---|
| Distinct characters | 9 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | COMPLETED |
|---|---|
| 2nd row | COMPLETED |
| 3rd row | COMPLETED |
| 4th row | COMPLETED |
| 5th row | COMPLETED |
| Value | Count | Frequency (%) |
| COMPLETED | 406904 | |
| ATTEMPTED | 6508 | 1.6% |
| Value | Count | Frequency (%) |
| completed | 406904 | |
| attempted | 6508 | 1.6% |
Most occurring characters
| Value | Count | Frequency (%) |
| E | 826824 | |
| T | 426428 | |
| M | 413412 | |
| P | 413412 | |
| D | 413412 | |
| C | 406904 | |
| O | 406904 | |
| L | 406904 | |
| A | 6508 | 0.2% |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 3720708 |
Most frequent character per category
| Value | Count | Frequency (%) |
| E | 826824 | |
| T | 426428 | |
| M | 413412 | |
| P | 413412 | |
| D | 413412 | |
| C | 406904 | |
| O | 406904 | |
| L | 406904 | |
| A | 6508 | 0.2% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 3720708 |
Most frequent character per script
| Value | Count | Frequency (%) |
| E | 826824 | |
| T | 426428 | |
| M | 413412 | |
| P | 413412 | |
| D | 413412 | |
| C | 406904 | |
| O | 406904 | |
| L | 406904 | |
| A | 6508 | 0.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 3720708 |
Most frequent character per block
| Value | Count | Frequency (%) |
| E | 826824 | |
| T | 426428 | |
| M | 413412 | |
| P | 413412 | |
| D | 413412 | |
| C | 406904 | |
| O | 406904 | |
| L | 406904 | |
| A | 6508 | 0.2% |
| Distinct | 25 |
|---|---|
| Distinct (%) | 1.6% |
| Missing | 411842 |
| Missing (%) | 99.6% |
| Memory size | 3.2 MiB |
| INGERSOLL | |
|---|---|
| WALD | |
| MANHATTANVILLE | |
| GRANT | |
| WHITMAN | |
| Other values (20) |
Length
| Max length | 31 |
|---|---|
| Median length | 8 |
| Mean length | 8.817834395 |
| Min length | 4 |
Characters and Unicode
| Total characters | 13844 |
|---|---|
| Distinct characters | 27 |
| Distinct categories | 5 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | 0.1% |
Sample
| 1st row | WHITMAN |
|---|---|
| 2nd row | NOSTRAND |
| 3rd row | WILLIAMSBURG |
| 4th row | WALD |
| 5th row | RIIS |
| Value | Count | Frequency (%) |
| INGERSOLL | 253 | 0.1% |
| WALD | 167 | < 0.1% |
| MANHATTANVILLE | 113 | < 0.1% |
| GRANT | 107 | < 0.1% |
| WHITMAN | 102 | < 0.1% |
| WILLIAMSBURG | 98 | < 0.1% |
| NOSTRAND | 96 | < 0.1% |
| MARBLE HILL | 92 | < 0.1% |
| RIIS | 81 | < 0.1% |
| SHEEPSHEAD BAY | 76 | < 0.1% |
| Other values (15) | 385 | 0.1% |
| (Missing) | 411842 |
| Value | Count | Frequency (%) |
| ingersoll | 253 | 12.6% |
| wald | 167 | 8.3% |
| riis | 124 | 6.2% |
| manhattanville | 113 | 5.6% |
| grant | 107 | 5.3% |
| whitman | 102 | 5.1% |
| williamsburg | 98 | 4.9% |
| nostrand | 96 | 4.8% |
| hill | 92 | 4.6% |
| marble | 92 | 4.6% |
| Other values (25) | 771 |
Most occurring characters
| Value | Count | Frequency (%) |
| L | 1497 | |
| A | 1367 | 9.9% |
| I | 1291 | 9.3% |
| R | 1061 | 7.7% |
| S | 1004 | 7.3% |
| E | 991 | 7.2% |
| N | 945 | 6.8% |
| O | 841 | 6.1% |
| T | 688 | 5.0% |
| H | 617 | 4.5% |
| Other values (17) | 3542 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 13366 | |
| Space Separator | 445 | 3.2% |
| Open Punctuation | 11 | 0.1% |
| Decimal Number | 11 | 0.1% |
| Close Punctuation | 11 | 0.1% |
Most frequent character per category
| Value | Count | Frequency (%) |
| L | 1497 | |
| A | 1367 | |
| I | 1291 | |
| R | 1061 | 7.9% |
| S | 1004 | 7.5% |
| E | 991 | 7.4% |
| N | 945 | 7.1% |
| O | 841 | 6.3% |
| T | 688 | 5.1% |
| H | 617 | 4.6% |
| Other values (13) | 3064 |
| Value | Count | Frequency (%) |
| 445 |
| Value | Count | Frequency (%) |
| ( | 11 |
| Value | Count | Frequency (%) |
| 5 | 11 |
| Value | Count | Frequency (%) |
| ) | 11 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 13366 | |
| Common | 478 | 3.5% |
Most frequent character per script
| Value | Count | Frequency (%) |
| L | 1497 | |
| A | 1367 | |
| I | 1291 | |
| R | 1061 | 7.9% |
| S | 1004 | 7.5% |
| E | 991 | 7.4% |
| N | 945 | 7.1% |
| O | 841 | 6.3% |
| T | 688 | 5.1% |
| H | 617 | 4.6% |
| Other values (13) | 3064 |
| Value | Count | Frequency (%) |
| 445 | ||
| ( | 11 | 2.3% |
| 5 | 11 | 2.3% |
| ) | 11 | 2.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 13844 |
Most frequent character per block
| Value | Count | Frequency (%) |
| L | 1497 | |
| A | 1367 | 9.9% |
| I | 1291 | 9.3% |
| R | 1061 | 7.7% |
| S | 1004 | 7.3% |
| E | 991 | 7.2% |
| N | 945 | 6.8% |
| O | 841 | 6.1% |
| T | 688 | 5.0% |
| H | 617 | 4.5% |
| Other values (17) | 3542 |
| Distinct | 339 |
|---|---|
| Distinct (%) | 1.1% |
| Missing | 382445 |
| Missing (%) | 92.5% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 7194.91407 |
|---|---|
| Minimum | 218 |
| Maximum | 71750 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 3.2 MiB |
Quantile statistics
| Minimum | 218 |
|---|---|
| 5-th percentile | 258 |
| Q1 | 489 |
| median | 720 |
| Q3 | 1269 |
| 95-th percentile | 44066 |
| Maximum | 71750 |
| Range | 71532 |
| Interquartile range (IQR) | 780 |
Descriptive statistics
| Standard deviation | 14347.62114 |
|---|---|
| Coefficient of variation (CV) | 1.994133774 |
| Kurtosis | 2.563301578 |
| Mean | 7194.91407 |
| Median Absolute Deviation (MAD) | 267 |
| Skewness | 2.009575782 |
| Sum | 222804904 |
| Variance | 205854232.5 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 670 | 474 | 0.1% |
| 887 | 411 | 0.1% |
| 720 | 405 | 0.1% |
| 1233 | 368 | 0.1% |
| 544 | 364 | 0.1% |
| 4552 | 363 | 0.1% |
| 1251 | 351 | 0.1% |
| 609 | 345 | 0.1% |
| 590 | 342 | 0.1% |
| 527 | 328 | 0.1% |
| Other values (329) | 27216 | 6.6% |
| (Missing) | 382445 |
| Value | Count | Frequency (%) |
| 218 | 259 | |
| 227 | 301 | |
| 235 | 108 | < 0.1% |
| 238 | 49 | < 0.1% |
| 240 | 30 | < 0.1% |
| Value | Count | Frequency (%) |
| 71750 | 3 | < 0.1% |
| 70679 | 1 | < 0.1% |
| 66871 | 2 | < 0.1% |
| 66563 | 124 | |
| 64443 | 5 | < 0.1% |
| Distinct | 18 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 463 |
| Missing (%) | 0.1% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.643767148 |
|---|---|
| Minimum | 0 |
| Maximum | 97 |
| Zeros | 371665 |
| Zeros (%) | 89.9% |
| Memory size | 3.2 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 2 |
| Maximum | 97 |
| Range | 97 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 6.574193338 |
|---|---|
| Coefficient of variation (CV) | 10.21206714 |
| Kurtosis | 198.1779824 |
| Mean | 0.643767148 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 14.04958619 |
| Sum | 265843 |
| Variance | 43.22001804 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 371665 | |
| 2 | 30589 | 7.4% |
| 1 | 7217 | 1.7% |
| 97 | 1550 | 0.4% |
| 3 | 1040 | 0.3% |
| 88 | 251 | 0.1% |
| 72 | 213 | 0.1% |
| 14 | 150 | < 0.1% |
| 4 | 87 | < 0.1% |
| 11 | 79 | < 0.1% |
| Other values (8) | 108 | < 0.1% |
| (Missing) | 463 | 0.1% |
| Value | Count | Frequency (%) |
| 0 | 371665 | |
| 1 | 7217 | 1.7% |
| 2 | 30589 | 7.4% |
| 3 | 1040 | 0.3% |
| 4 | 87 | < 0.1% |
| Value | Count | Frequency (%) |
| 97 | 1550 | |
| 88 | 251 | 0.1% |
| 87 | 21 | < 0.1% |
| 85 | 5 | < 0.1% |
| 72 | 213 | 0.1% |
JURIS_DESC
Categorical
| Distinct | 18 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 3.2 MiB |
| N.Y. POLICE DEPT | |
|---|---|
| N.Y. HOUSING POLICE | 30660 |
| N.Y. TRANSIT POLICE | 7223 |
| OTHER | 1551 |
| PORT AUTHORITY | 1040 |
| Other values (13) | 888 |
Length
| Max length | 28 |
|---|---|
| Median length | 16 |
| Mean length | 16.22790824 |
| Min length | 5 |
Characters and Unicode
| Total characters | 6708812 |
|---|---|
| Distinct characters | 26 |
| Distinct categories | 4 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | N.Y. POLICE DEPT |
|---|---|
| 2nd row | N.Y. POLICE DEPT |
| 3rd row | N.Y. POLICE DEPT |
| 4th row | N.Y. POLICE DEPT |
| 5th row | N.Y. HOUSING POLICE |
| Value | Count | Frequency (%) |
| N.Y. POLICE DEPT | 372050 | |
| N.Y. HOUSING POLICE | 30660 | 7.4% |
| N.Y. TRANSIT POLICE | 7223 | 1.7% |
| OTHER | 1551 | 0.4% |
| PORT AUTHORITY | 1040 | 0.3% |
| NYC PARKS | 251 | 0.1% |
| DEPT OF CORRECTIONS | 213 | 0.1% |
| HEALTH & HOSP CORP | 150 | < 0.1% |
| TRI-BORO BRDG TUNNL | 87 | < 0.1% |
| N.Y. STATE POLICE | 79 | < 0.1% |
| Other values (8) | 108 | < 0.1% |
| Value | Count | Frequency (%) |
| police | 410030 | |
| n.y | 410027 | |
| dept | 372268 | |
| housing | 30660 | 2.5% |
| transit | 7223 | 0.6% |
| other | 1551 | 0.1% |
| port | 1040 | 0.1% |
| authority | 1040 | 0.1% |
| parks | 266 | < 0.1% |
| nyc | 251 | < 0.1% |
| Other values (31) | 1668 | 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 822612 | ||
| . | 820090 | |
| E | 784400 | |
| P | 783935 | |
| I | 449359 | |
| N | 448653 | |
| O | 445534 | |
| Y | 411365 | |
| C | 410908 | |
| L | 410285 | |
| Other values (16) | 921671 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 5065873 | |
| Space Separator | 822612 | 12.3% |
| Other Punctuation | 820240 | 12.2% |
| Dash Punctuation | 87 | < 0.1% |
Most frequent character per category
| Value | Count | Frequency (%) |
| E | 784400 | |
| P | 783935 | |
| I | 449359 | |
| N | 448653 | |
| O | 445534 | |
| Y | 411365 | |
| C | 410908 | |
| L | 410285 | |
| T | 392231 | |
| D | 372385 | |
| Other values (12) | 156818 | 3.1% |
| Value | Count | Frequency (%) |
| . | 820090 | |
| & | 150 | < 0.1% |
| Value | Count | Frequency (%) |
| 822612 |
| Value | Count | Frequency (%) |
| - | 87 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 5065873 | |
| Common | 1642939 | 24.5% |
Most frequent character per script
| Value | Count | Frequency (%) |
| E | 784400 | |
| P | 783935 | |
| I | 449359 | |
| N | 448653 | |
| O | 445534 | |
| Y | 411365 | |
| C | 410908 | |
| L | 410285 | |
| T | 392231 | |
| D | 372385 | |
| Other values (12) | 156818 | 3.1% |
| Value | Count | Frequency (%) |
| 822612 | ||
| . | 820090 | |
| & | 150 | < 0.1% |
| - | 87 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 6708812 |
Most frequent character per block
| Value | Count | Frequency (%) |
| 822612 | ||
| . | 820090 | |
| E | 784400 | |
| P | 783935 | |
| I | 449359 | |
| N | 448653 | |
| O | 445534 | |
| Y | 411365 | |
| C | 410908 | |
| L | 410285 | |
| Other values (16) | 921671 |
KY_CD
Real number (ℝ≥0)
| Distinct | 64 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 304.4962048 |
|---|---|
| Minimum | 101 |
| Maximum | 678 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 3.2 MiB |
Quantile statistics
| Minimum | 101 |
|---|---|
| 5-th percentile | 106 |
| Q1 | 117 |
| median | 341 |
| Q3 | 351 |
| 95-th percentile | 578 |
| Maximum | 678 |
| Range | 577 |
| Interquartile range (IQR) | 234 |
Descriptive statistics
| Standard deviation | 159.5255018 |
|---|---|
| Coefficient of variation (CV) | 0.5238998035 |
| Kurtosis | -0.8573247433 |
| Mean | 304.4962048 |
| Median Absolute Deviation (MAD) | 108 |
| Skewness | 0.2539769354 |
| Sum | 125882385 |
| Variance | 25448.38573 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 341 | 82061 | |
| 578 | 66736 | |
| 344 | 43532 | |
| 351 | 35711 | |
| 109 | 35482 | |
| 106 | 20554 | 5.0% |
| 361 | 15685 | 3.8% |
| 107 | 15468 | 3.7% |
| 105 | 13100 | 3.2% |
| 126 | 12332 | 3.0% |
| Other values (54) | 72751 |
| Value | Count | Frequency (%) |
| 101 | 463 | 0.1% |
| 102 | 1 | < 0.1% |
| 103 | 13 | < 0.1% |
| 104 | 1426 | 0.3% |
| 105 | 13100 |
| Value | Count | Frequency (%) |
| 678 | 438 | 0.1% |
| 677 | 11 | < 0.1% |
| 676 | 1 | < 0.1% |
| 675 | 47 | < 0.1% |
| 578 | 66736 |
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 3.2 MiB |
| MISDEMEANOR | |
|---|---|
| FELONY | |
| VIOLATION |
Length
| Max length | 11 |
|---|---|
| Median length | 11 |
| Mean length | 9.042037967 |
| Min length | 6 |
Characters and Unicode
| Total characters | 3738087 |
|---|---|
| Distinct characters | 14 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | FELONY |
|---|---|
| 2nd row | FELONY |
| 3rd row | FELONY |
| 4th row | FELONY |
| 5th row | FELONY |
| Value | Count | Frequency (%) |
| MISDEMEANOR | 211170 | |
| FELONY | 134987 | |
| VIOLATION | 67255 | 16.3% |
| Value | Count | Frequency (%) |
| misdemeanor | 211170 | |
| felony | 134987 | |
| violation | 67255 | 16.3% |
Most occurring characters
| Value | Count | Frequency (%) |
| E | 557327 | |
| O | 480667 | |
| M | 422340 | |
| N | 413412 | |
| I | 345680 | |
| A | 278425 | |
| S | 211170 | 5.6% |
| D | 211170 | 5.6% |
| R | 211170 | 5.6% |
| L | 202242 | 5.4% |
| Other values (4) | 404484 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 3738087 |
Most frequent character per category
| Value | Count | Frequency (%) |
| E | 557327 | |
| O | 480667 | |
| M | 422340 | |
| N | 413412 | |
| I | 345680 | |
| A | 278425 | |
| S | 211170 | 5.6% |
| D | 211170 | 5.6% |
| R | 211170 | 5.6% |
| L | 202242 | 5.4% |
| Other values (4) | 404484 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 3738087 |
Most frequent character per script
| Value | Count | Frequency (%) |
| E | 557327 | |
| O | 480667 | |
| M | 422340 | |
| N | 413412 | |
| I | 345680 | |
| A | 278425 | |
| S | 211170 | 5.6% |
| D | 211170 | 5.6% |
| R | 211170 | 5.6% |
| L | 202242 | 5.4% |
| Other values (4) | 404484 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 3738087 |
Most frequent character per block
| Value | Count | Frequency (%) |
| E | 557327 | |
| O | 480667 | |
| M | 422340 | |
| N | 413412 | |
| I | 345680 | |
| A | 278425 | |
| S | 211170 | 5.6% |
| D | 211170 | 5.6% |
| R | 211170 | 5.6% |
| L | 202242 | 5.4% |
| Other values (4) | 404484 |
| Distinct | 5 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 66086 |
| Missing (%) | 16.0% |
| Memory size | 3.2 MiB |
| INSIDE | |
|---|---|
| FRONT OF | |
| OPPOSITE OF | 10027 |
| REAR OF | 8377 |
| OUTSIDE | 322 |
Length
| Max length | 11 |
|---|---|
| Median length | 6 |
| Mean length | 6.812435579 |
| Min length | 6 |
Characters and Unicode
| Total characters | 2366136 |
|---|---|
| Distinct characters | 13 |
| Distinct categories | 2 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | OUTSIDE |
|---|---|
| 2nd row | INSIDE |
| 3rd row | INSIDE |
| 4th row | OUTSIDE |
| 5th row | OUTSIDE |
| Value | Count | Frequency (%) |
| INSIDE | 216927 | |
| FRONT OF | 111673 | |
| OPPOSITE OF | 10027 | 2.4% |
| REAR OF | 8377 | 2.0% |
| OUTSIDE | 322 | 0.1% |
| (Missing) | 66086 | 16.0% |
| Value | Count | Frequency (%) |
| inside | 216927 | |
| of | 130077 | |
| front | 111673 | |
| opposite | 10027 | 2.1% |
| rear | 8377 | 1.8% |
| outside | 322 | 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| I | 444203 | |
| N | 328600 | |
| O | 262126 | |
| F | 241750 | |
| E | 235653 | |
| S | 227276 | |
| D | 217249 | |
| 130077 | 5.5% | |
| R | 128427 | 5.4% |
| T | 122022 | 5.2% |
| Other values (3) | 28753 | 1.2% |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 2236059 | |
| Space Separator | 130077 | 5.5% |
Most frequent character per category
| Value | Count | Frequency (%) |
| I | 444203 | |
| N | 328600 | |
| O | 262126 | |
| F | 241750 | |
| E | 235653 | |
| S | 227276 | |
| D | 217249 | |
| R | 128427 | 5.7% |
| T | 122022 | 5.5% |
| P | 20054 | 0.9% |
| Other values (2) | 8699 | 0.4% |
| Value | Count | Frequency (%) |
| 130077 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 2236059 | |
| Common | 130077 | 5.5% |
Most frequent character per script
| Value | Count | Frequency (%) |
| I | 444203 | |
| N | 328600 | |
| O | 262126 | |
| F | 241750 | |
| E | 235653 | |
| S | 227276 | |
| D | 217249 | |
| R | 128427 | 5.7% |
| T | 122022 | 5.5% |
| P | 20054 | 0.9% |
| Other values (2) | 8699 | 0.4% |
| Value | Count | Frequency (%) |
| 130077 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2366136 |
Most frequent character per block
| Value | Count | Frequency (%) |
| I | 444203 | |
| N | 328600 | |
| O | 262126 | |
| F | 241750 | |
| E | 235653 | |
| S | 227276 | |
| D | 217249 | |
| 130077 | 5.5% | |
| R | 128427 | 5.4% |
| T | 122022 | 5.2% |
| Other values (3) | 28753 | 1.2% |
| Distinct | 59 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 6 |
| Missing (%) | < 0.1% |
| Memory size | 3.2 MiB |
| PETIT LARCENY | |
|---|---|
| HARRASSMENT 2 | |
| CRIMINAL MISCHIEF & RELATED OF | |
| ASSAULT 3 & RELATED OFFENSES | |
| GRAND LARCENY | |
| Other values (54) |
Length
| Max length | 36 |
|---|---|
| Median length | 13 |
| Mean length | 18.20467773 |
| Min length | 4 |
Characters and Unicode
| Total characters | 7525923 |
|---|---|
| Distinct characters | 35 |
| Distinct categories | 6 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 4 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | MURDER & NON-NEGL. MANSLAUGHTER |
|---|---|
| 2nd row | MURDER & NON-NEGL. MANSLAUGHTER |
| 3rd row | RAPE |
| 4th row | MURDER & NON-NEGL. MANSLAUGHTER |
| 5th row | MURDER & NON-NEGL. MANSLAUGHTER |
| Value | Count | Frequency (%) |
| PETIT LARCENY | 82061 | |
| HARRASSMENT 2 | 66736 | |
| CRIMINAL MISCHIEF & RELATED OF | 47271 | |
| ASSAULT 3 & RELATED OFFENSES | 43532 | |
| GRAND LARCENY | 35482 | |
| FELONY ASSAULT | 20554 | 5.0% |
| OFF. AGNST PUB ORD SENSBLTY & | 15685 | 3.8% |
| BURGLARY | 15468 | 3.7% |
| ROBBERY | 13100 | 3.2% |
| MISCELLANEOUS PENAL LAW | 12765 | 3.1% |
| Other values (49) | 60752 |
| Value | Count | Frequency (%) |
| larceny | 126745 | 10.5% |
| 109723 | 9.1% | |
| related | 91742 | 7.6% |
| petit | 82224 | 6.8% |
| harrassment | 66736 | 5.5% |
| 2 | 66736 | 5.5% |
| assault | 64086 | 5.3% |
| of | 59233 | 4.9% |
| offenses | 51788 | 4.3% |
| criminal | 49410 | 4.1% |
| Other values (104) | 440054 |
Most occurring characters
| Value | Count | Frequency (%) |
| 795071 | 10.6% | |
| E | 788876 | 10.5% |
| A | 721832 | 9.6% |
| R | 588569 | 7.8% |
| S | 556691 | 7.4% |
| N | 470878 | 6.3% |
| L | 470049 | 6.2% |
| T | 463775 | 6.2% |
| I | 363745 | 4.8% |
| F | 286247 | 3.8% |
| Other values (25) | 2020190 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 6490518 | |
| Space Separator | 795071 | 10.6% |
| Other Punctuation | 126136 | 1.7% |
| Decimal Number | 110272 | 1.5% |
| Dash Punctuation | 3661 | < 0.1% |
| Open Punctuation | 265 | < 0.1% |
Most frequent character per category
| Value | Count | Frequency (%) |
| E | 788876 | |
| A | 721832 | |
| R | 588569 | 9.1% |
| S | 556691 | 8.6% |
| N | 470878 | 7.3% |
| L | 470049 | 7.2% |
| T | 463775 | 7.1% |
| I | 363745 | 5.6% |
| F | 286247 | 4.4% |
| C | 274897 | 4.2% |
| Other values (15) | 1504959 |
| Value | Count | Frequency (%) |
| & | 109723 | |
| . | 16152 | 12.8% |
| ' | 229 | 0.2% |
| / | 18 | < 0.1% |
| , | 14 | < 0.1% |
| Value | Count | Frequency (%) |
| 2 | 66736 | |
| 3 | 43536 |
| Value | Count | Frequency (%) |
| 795071 |
| Value | Count | Frequency (%) |
| - | 3661 |
| Value | Count | Frequency (%) |
| ( | 265 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 6490518 | |
| Common | 1035405 | 13.8% |
Most frequent character per script
| Value | Count | Frequency (%) |
| E | 788876 | |
| A | 721832 | |
| R | 588569 | 9.1% |
| S | 556691 | 8.6% |
| N | 470878 | 7.3% |
| L | 470049 | 7.2% |
| T | 463775 | 7.1% |
| I | 363745 | 5.6% |
| F | 286247 | 4.4% |
| C | 274897 | 4.2% |
| Other values (15) | 1504959 |
| Value | Count | Frequency (%) |
| 795071 | ||
| & | 109723 | 10.6% |
| 2 | 66736 | 6.4% |
| 3 | 43536 | 4.2% |
| . | 16152 | 1.6% |
| - | 3661 | 0.4% |
| ( | 265 | < 0.1% |
| ' | 229 | < 0.1% |
| / | 18 | < 0.1% |
| , | 14 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 7525923 |
Most frequent character per block
| Value | Count | Frequency (%) |
| 795071 | 10.6% | |
| E | 788876 | 10.5% |
| A | 721832 | 9.6% |
| R | 588569 | 7.8% |
| S | 556691 | 7.4% |
| N | 470878 | 6.3% |
| L | 470049 | 6.2% |
| T | 463775 | 6.2% |
| I | 363745 | 4.8% |
| F | 286247 | 3.8% |
| Other values (25) | 2020190 |
| Distinct | 508 |
|---|---|
| Distinct (%) | 19.0% |
| Missing | 410736 |
| Missing (%) | 99.4% |
| Memory size | 3.2 MiB |
| WASHINGTON SQUARE PARK | 200 |
|---|---|
| CENTRAL PARK | 184 |
| FLUSHING MEADOWS CORONA PARK | 100 |
| CONEY ISLAND BEACH & BOARDWALK | 70 |
| UNION SQUARE PARK | 66 |
| Other values (503) |
Length
| Max length | 59 |
|---|---|
| Median length | 17 |
| Mean length | 18.51606876 |
| Min length | 7 |
Characters and Unicode
| Total characters | 49549 |
|---|---|
| Distinct characters | 45 |
| Distinct categories | 7 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 246 ? |
|---|---|
| Unique (%) | 9.2% |
Sample
| 1st row | MARCUS GARVEY PARK |
|---|---|
| 2nd row | BROOKVILLE PARK |
| 3rd row | WAYANDA PARK |
| 4th row | ASPHALT GREEN |
| 5th row | WASHINGTON SQUARE PARK |
| Value | Count | Frequency (%) |
| WASHINGTON SQUARE PARK | 200 | < 0.1% |
| CENTRAL PARK | 184 | < 0.1% |
| FLUSHING MEADOWS CORONA PARK | 100 | < 0.1% |
| CONEY ISLAND BEACH & BOARDWALK | 70 | < 0.1% |
| UNION SQUARE PARK | 66 | < 0.1% |
| PROSPECT PARK | 64 | < 0.1% |
| RIVERSIDE PARK | 64 | < 0.1% |
| HUDSON RIVER PARK | 51 | < 0.1% |
| MARCUS GARVEY PARK | 45 | < 0.1% |
| BRYANT PARK | 40 | < 0.1% |
| Other values (498) | 1792 | 0.4% |
| (Missing) | 410736 |
| Value | Count | Frequency (%) |
| park | 2045 | |
| square | 357 | 4.7% |
| playground | 317 | 4.1% |
| washington | 208 | 2.7% |
| central | 185 | 2.4% |
| beach | 126 | 1.6% |
| corona | 104 | 1.4% |
| boardwalk | 104 | 1.4% |
| flushing | 101 | 1.3% |
| meadows | 100 | 1.3% |
| Other values (676) | 4024 |
Most occurring characters
| Value | Count | Frequency (%) |
| A | 5782 | 11.7% |
| R | 5475 | 11.0% |
| 4995 | 10.1% | |
| E | 3207 | 6.5% |
| N | 3057 | 6.2% |
| P | 2804 | 5.7% |
| O | 2664 | 5.4% |
| K | 2544 | 5.1% |
| S | 2259 | 4.6% |
| L | 1896 | 3.8% |
| Other values (35) | 14866 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 44059 | |
| Space Separator | 4995 | 10.1% |
| Other Punctuation | 399 | 0.8% |
| Decimal Number | 35 | 0.1% |
| Open Punctuation | 21 | < 0.1% |
| Close Punctuation | 21 | < 0.1% |
| Dash Punctuation | 19 | < 0.1% |
Most frequent character per category
| Value | Count | Frequency (%) |
| A | 5782 | |
| R | 5475 | |
| E | 3207 | 7.3% |
| N | 3057 | 6.9% |
| P | 2804 | 6.4% |
| O | 2664 | 6.0% |
| K | 2544 | 5.8% |
| S | 2259 | 5.1% |
| L | 1896 | 4.3% |
| I | 1697 | 3.9% |
| Other values (16) | 12674 |
| Value | Count | Frequency (%) |
| 4 | 8 | |
| 1 | 7 | |
| 5 | 6 | |
| 2 | 4 | |
| 7 | 3 | 8.6% |
| 9 | 2 | 5.7% |
| 8 | 2 | 5.7% |
| 6 | 2 | 5.7% |
| 3 | 1 | 2.9% |
| Value | Count | Frequency (%) |
| . | 202 | |
| ' | 81 | |
| & | 76 | 19.0% |
| / | 31 | 7.8% |
| " | 8 | 2.0% |
| , | 1 | 0.3% |
| Value | Count | Frequency (%) |
| 4995 |
| Value | Count | Frequency (%) |
| - | 19 |
| Value | Count | Frequency (%) |
| ( | 21 |
| Value | Count | Frequency (%) |
| ) | 21 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 44059 | |
| Common | 5490 | 11.1% |
Most frequent character per script
| Value | Count | Frequency (%) |
| A | 5782 | |
| R | 5475 | |
| E | 3207 | 7.3% |
| N | 3057 | 6.9% |
| P | 2804 | 6.4% |
| O | 2664 | 6.0% |
| K | 2544 | 5.8% |
| S | 2259 | 5.1% |
| L | 1896 | 4.3% |
| I | 1697 | 3.9% |
| Other values (16) | 12674 |
| Value | Count | Frequency (%) |
| 4995 | ||
| . | 202 | 3.7% |
| ' | 81 | 1.5% |
| & | 76 | 1.4% |
| / | 31 | 0.6% |
| ( | 21 | 0.4% |
| ) | 21 | 0.4% |
| - | 19 | 0.3% |
| 4 | 8 | 0.1% |
| " | 8 | 0.1% |
| Other values (9) | 28 | 0.5% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 49549 |
Most frequent character per block
| Value | Count | Frequency (%) |
| A | 5782 | 11.7% |
| R | 5475 | 11.0% |
| 4995 | 10.1% | |
| E | 3207 | 6.5% |
| N | 3057 | 6.2% |
| P | 2804 | 5.7% |
| O | 2664 | 5.4% |
| K | 2544 | 5.1% |
| S | 2259 | 4.6% |
| L | 1896 | 3.8% |
| Other values (35) | 14866 |
| Distinct | 8 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 463 |
| Missing (%) | 0.1% |
| Memory size | 3.2 MiB |
| PATROL BORO BRONX | |
|---|---|
| PATROL BORO BKLYN NORTH | |
| PATROL BORO BKLYN SOUTH | |
| PATROL BORO MAN NORTH | |
| PATROL BORO QUEENS NORTH | |
| Other values (3) |
Length
| Max length | 25 |
|---|---|
| Median length | 23 |
| Mean length | 21.51259841 |
| Min length | 17 |
Characters and Unicode
| Total characters | 8883606 |
|---|---|
| Distinct characters | 20 |
| Distinct categories | 2 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | PATROL BORO BRONX |
|---|---|
| 2nd row | PATROL BORO BRONX |
| 3rd row | PATROL BORO BRONX |
| 4th row | PATROL BORO BRONX |
| 5th row | PATROL BORO QUEENS SOUTH |
| Value | Count | Frequency (%) |
| PATROL BORO BRONX | 90442 | |
| PATROL BORO BKLYN NORTH | 60579 | |
| PATROL BORO BKLYN SOUTH | 58634 | |
| PATROL BORO MAN NORTH | 51366 | |
| PATROL BORO QUEENS NORTH | 46213 | |
| PATROL BORO MAN SOUTH | 45916 | |
| PATROL BORO QUEENS SOUTH | 42816 | |
| PATROL BORO STATEN ISLAND | 16983 | 4.1% |
| (Missing) | 463 | 0.1% |
| Value | Count | Frequency (%) |
| patrol | 412949 | |
| boro | 412949 | |
| north | 158158 | 10.1% |
| south | 147366 | 9.4% |
| bklyn | 119213 | 7.6% |
| man | 97282 | 6.2% |
| bronx | 90442 | 5.8% |
| queens | 89029 | 5.7% |
| island | 16983 | 1.1% |
| staten | 16983 | 1.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| O | 1634813 | |
| 1148405 | ||
| R | 1074498 | |
| T | 752439 | |
| B | 622604 | 7.0% |
| N | 588090 | 6.6% |
| L | 549145 | 6.2% |
| A | 544197 | 6.1% |
| P | 412949 | 4.6% |
| H | 305524 | 3.4% |
| Other values (10) | 1250942 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 7735201 | |
| Space Separator | 1148405 | 12.9% |
Most frequent character per category
| Value | Count | Frequency (%) |
| O | 1634813 | |
| R | 1074498 | |
| T | 752439 | |
| B | 622604 | 8.0% |
| N | 588090 | 7.6% |
| L | 549145 | 7.1% |
| A | 544197 | 7.0% |
| P | 412949 | 5.3% |
| H | 305524 | 3.9% |
| S | 270361 | 3.5% |
| Other values (9) | 980581 |
| Value | Count | Frequency (%) |
| 1148405 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 7735201 | |
| Common | 1148405 | 12.9% |
Most frequent character per script
| Value | Count | Frequency (%) |
| O | 1634813 | |
| R | 1074498 | |
| T | 752439 | |
| B | 622604 | 8.0% |
| N | 588090 | 7.6% |
| L | 549145 | 7.1% |
| A | 544197 | 7.0% |
| P | 412949 | 5.3% |
| H | 305524 | 3.9% |
| S | 270361 | 3.5% |
| Other values (9) | 980581 |
| Value | Count | Frequency (%) |
| 1148405 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 8883606 |
Most frequent character per block
| Value | Count | Frequency (%) |
| O | 1634813 | |
| 1148405 | ||
| R | 1074498 | |
| T | 752439 | |
| B | 622604 | 7.0% |
| N | 588090 | 6.6% |
| L | 549145 | 6.2% |
| A | 544197 | 6.1% |
| P | 412949 | 4.6% |
| H | 305524 | 3.4% |
| Other values (10) | 1250942 |
PD_CD
Real number (ℝ≥0)
| Distinct | 345 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 463 |
| Missing (%) | 0.1% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 390.6094578 |
|---|---|
| Minimum | 100 |
| Maximum | 922 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 3.2 MiB |
Quantile statistics
| Minimum | 100 |
|---|---|
| 5-th percentile | 101 |
| Q1 | 254 |
| median | 343 |
| Q3 | 637 |
| 95-th percentile | 748 |
| Maximum | 922 |
| Range | 822 |
| Interquartile range (IQR) | 383 |
Descriptive statistics
| Standard deviation | 210.3504185 |
|---|---|
| Coefficient of variation (CV) | 0.5385184979 |
| Kurtosis | -0.6739598702 |
| Mean | 390.6094578 |
| Median Absolute Deviation (MAD) | 135 |
| Skewness | 0.4442739057 |
| Sum | 161301785 |
| Variance | 44247.29856 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 638 | 47900 | 11.6% |
| 101 | 33373 | 8.1% |
| 333 | 28168 | 6.8% |
| 637 | 18836 | 4.6% |
| 109 | 15861 | 3.8% |
| 639 | 14444 | 3.5% |
| 254 | 14002 | 3.4% |
| 321 | 12842 | 3.1% |
| 259 | 12555 | 3.0% |
| 352 | 9983 | 2.4% |
| Other values (335) | 204985 |
| Value | Count | Frequency (%) |
| 100 | 8 | < 0.1% |
| 101 | 33373 | |
| 102 | 13 | < 0.1% |
| 103 | 36 | < 0.1% |
| 104 | 18 | < 0.1% |
| Value | Count | Frequency (%) |
| 922 | 244 | 0.1% |
| 918 | 16 | < 0.1% |
| 916 | 6056 | |
| 907 | 32 | < 0.1% |
| 905 | 2349 | 0.6% |
| Distinct | 336 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 463 |
| Missing (%) | 0.1% |
| Memory size | 3.2 MiB |
| HARASSMENT,SUBD 3,4,5 | |
|---|---|
| ASSAULT 3 | |
| LARCENY,PETIT FROM STORE-SHOPL | |
| HARASSMENT,SUBD 1,CIVILIAN | 18836 |
| ASSAULT 2,1,UNCLASSIFIED | 15861 |
| Other values (331) |
Length
| Max length | 71 |
|---|---|
| Median length | 26 |
| Mean length | 26.59849521 |
| Min length | 6 |
Characters and Unicode
| Total characters | 10983822 |
|---|---|
| Distinct characters | 40 |
| Distinct categories | 7 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 27 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | RAPE 1 |
|---|---|
| 2nd row | WEAPONS POSSESSION 3 |
| 3rd row | HARASSMENT,SUBD 3,4,5 |
| 4th row | ARSON 1 |
| 5th row | FORGERY,ETC.-MISD. |
| Value | Count | Frequency (%) |
| HARASSMENT,SUBD 3,4,5 | 47900 | 11.6% |
| ASSAULT 3 | 33373 | 8.1% |
| LARCENY,PETIT FROM STORE-SHOPL | 28168 | 6.8% |
| HARASSMENT,SUBD 1,CIVILIAN | 18836 | 4.6% |
| ASSAULT 2,1,UNCLASSIFIED | 15861 | 3.8% |
| AGGRAVATED HARASSMENT 2 | 14444 | 3.5% |
| MISCHIEF, CRIMINAL 4, OF MOTOR | 14002 | 3.4% |
| LARCENY,PETIT FROM AUTO | 12842 | 3.1% |
| CRIMINAL MISCHIEF,UNCLASSIFIED 4 | 12555 | 3.0% |
| LARCENY,PETIT FROM BUILDING,UNATTENDED, PACKAGE THEFT INSIDE | 9983 | 2.4% |
| Other values (326) | 204985 |
| Value | Count | Frequency (%) |
| from | 87018 | 7.1% |
| larceny,petit | 81807 | 6.7% |
| harassment,subd | 66736 | 5.5% |
| of | 52734 | 4.3% |
| criminal | 51321 | 4.2% |
| assault | 50818 | 4.2% |
| 3,4,5 | 47900 | 3.9% |
| 3 | 44699 | 3.7% |
| larceny,grand | 44063 | 3.6% |
| store-shopl | 30919 | 2.5% |
| Other values (476) | 665877 |
Most occurring characters
| Value | Count | Frequency (%) |
| E | 888683 | 8.1% |
| A | 880169 | 8.0% |
| 830564 | 7.6% | |
| S | 746812 | 6.8% |
| I | 710698 | 6.5% |
| N | 688398 | 6.3% |
| R | 688291 | 6.3% |
| T | 630642 | 5.7% |
| , | 550538 | 5.0% |
| C | 541448 | 4.9% |
| Other values (30) | 3827579 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 9171067 | |
| Space Separator | 830564 | 7.6% |
| Other Punctuation | 580346 | 5.3% |
| Decimal Number | 337236 | 3.1% |
| Dash Punctuation | 56782 | 0.5% |
| Open Punctuation | 4015 | < 0.1% |
| Close Punctuation | 3812 | < 0.1% |
Most frequent character per category
| Value | Count | Frequency (%) |
| E | 888683 | 9.7% |
| A | 880169 | 9.6% |
| S | 746812 | 8.1% |
| I | 710698 | 7.7% |
| N | 688398 | 7.5% |
| R | 688291 | 7.5% |
| T | 630642 | 6.9% |
| C | 541448 | 5.9% |
| L | 508484 | 5.5% |
| O | 454811 | 5.0% |
| Other values (16) | 2432631 |
| Value | Count | Frequency (%) |
| 3 | 96291 | |
| 4 | 85730 | |
| 1 | 53953 | |
| 2 | 51774 | |
| 5 | 49380 | |
| 7 | 108 | < 0.1% |
| Value | Count | Frequency (%) |
| , | 550538 | |
| / | 14794 | 2.5% |
| & | 9128 | 1.6% |
| . | 5886 | 1.0% |
| Value | Count | Frequency (%) |
| 830564 |
| Value | Count | Frequency (%) |
| - | 56782 |
| Value | Count | Frequency (%) |
| ( | 4015 |
| Value | Count | Frequency (%) |
| ) | 3812 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 9171067 | |
| Common | 1812755 | 16.5% |
Most frequent character per script
| Value | Count | Frequency (%) |
| E | 888683 | 9.7% |
| A | 880169 | 9.6% |
| S | 746812 | 8.1% |
| I | 710698 | 7.7% |
| N | 688398 | 7.5% |
| R | 688291 | 7.5% |
| T | 630642 | 6.9% |
| C | 541448 | 5.9% |
| L | 508484 | 5.5% |
| O | 454811 | 5.0% |
| Other values (16) | 2432631 |
| Value | Count | Frequency (%) |
| 830564 | ||
| , | 550538 | |
| 3 | 96291 | 5.3% |
| 4 | 85730 | 4.7% |
| - | 56782 | 3.1% |
| 1 | 53953 | 3.0% |
| 2 | 51774 | 2.9% |
| 5 | 49380 | 2.7% |
| / | 14794 | 0.8% |
| & | 9128 | 0.5% |
| Other values (4) | 13821 | 0.8% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 10983822 |
Most frequent character per block
| Value | Count | Frequency (%) |
| E | 888683 | 8.1% |
| A | 880169 | 8.0% |
| 830564 | 7.6% | |
| S | 746812 | 6.8% |
| I | 710698 | 6.5% |
| N | 688398 | 6.3% |
| R | 688291 | 6.3% |
| T | 630642 | 5.7% |
| , | 550538 | 5.0% |
| C | 541448 | 4.9% |
| Other values (30) | 3827579 |
| Distinct | 74 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 1172 |
| Missing (%) | 0.3% |
| Memory size | 3.2 MiB |
| STREET | |
|---|---|
| RESIDENCE - APT. HOUSE | |
| RESIDENCE-HOUSE | |
| RESIDENCE - PUBLIC HOUSING | |
| CHAIN STORE | |
| Other values (69) |
Length
| Max length | 28 |
|---|---|
| Median length | 15 |
| Mean length | 14.54387978 |
| Min length | 3 |
Characters and Unicode
| Total characters | 5995569 |
|---|---|
| Distinct characters | 32 |
| Distinct categories | 6 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | STREET |
|---|---|
| 2nd row | STREET |
| 3rd row | RESIDENCE - APT. HOUSE |
| 4th row | OTHER |
| 5th row | STREET |
| Value | Count | Frequency (%) |
| STREET | 123143 | |
| RESIDENCE - APT. HOUSE | 99028 | |
| RESIDENCE-HOUSE | 44419 | 10.7% |
| RESIDENCE - PUBLIC HOUSING | 30613 | 7.4% |
| CHAIN STORE | 14703 | 3.6% |
| OTHER | 9283 | 2.2% |
| COMMERCIAL BUILDING | 8933 | 2.2% |
| DRUG STORE | 8594 | 2.1% |
| GROCERY/BODEGA | 7194 | 1.7% |
| TRANSIT - NYC SUBWAY | 7141 | 1.7% |
| Other values (64) | 59189 |
| Value | Count | Frequency (%) |
| 137878 | ||
| residence | 129641 | |
| street | 123143 | |
| house | 99058 | |
| apt | 99028 | |
| residence-house | 44419 | 4.9% |
| public | 34962 | 3.9% |
| store | 33611 | 3.7% |
| housing | 30613 | 3.4% |
| chain | 14703 | 1.6% |
| Other values (92) | 155139 |
Most occurring characters
| Value | Count | Frequency (%) |
| E | 1050619 | |
| S | 559334 | 9.3% |
| 489955 | 8.2% | |
| T | 475201 | 7.9% |
| R | 440756 | 7.4% |
| I | 334220 | 5.6% |
| N | 291622 | 4.9% |
| O | 290828 | 4.9% |
| C | 276778 | 4.6% |
| U | 262111 | 4.4% |
| Other values (22) | 1524145 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 5179898 | |
| Space Separator | 489955 | 8.2% |
| Dash Punctuation | 181201 | 3.0% |
| Other Punctuation | 132395 | 2.2% |
| Open Punctuation | 6060 | 0.1% |
| Close Punctuation | 6060 | 0.1% |
Most frequent character per category
| Value | Count | Frequency (%) |
| E | 1050619 | |
| S | 559334 | |
| T | 475201 | |
| R | 440756 | |
| I | 334220 | 6.5% |
| N | 291622 | 5.6% |
| O | 290828 | 5.6% |
| C | 276778 | 5.3% |
| U | 262111 | 5.1% |
| D | 227371 | 4.4% |
| Other values (15) | 971058 |
| Value | Count | Frequency (%) |
| . | 99833 | |
| / | 31466 | 23.8% |
| & | 1096 | 0.8% |
| Value | Count | Frequency (%) |
| 489955 |
| Value | Count | Frequency (%) |
| - | 181201 |
| Value | Count | Frequency (%) |
| ( | 6060 |
| Value | Count | Frequency (%) |
| ) | 6060 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 5179898 | |
| Common | 815671 | 13.6% |
Most frequent character per script
| Value | Count | Frequency (%) |
| E | 1050619 | |
| S | 559334 | |
| T | 475201 | |
| R | 440756 | |
| I | 334220 | 6.5% |
| N | 291622 | 5.6% |
| O | 290828 | 5.6% |
| C | 276778 | 5.3% |
| U | 262111 | 5.1% |
| D | 227371 | 4.4% |
| Other values (15) | 971058 |
| Value | Count | Frequency (%) |
| 489955 | ||
| - | 181201 | 22.2% |
| . | 99833 | 12.2% |
| / | 31466 | 3.9% |
| ( | 6060 | 0.7% |
| ) | 6060 | 0.7% |
| & | 1096 | 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 5995569 |
Most frequent character per block
| Value | Count | Frequency (%) |
| E | 1050619 | |
| S | 559334 | 9.3% |
| 489955 | 8.2% | |
| T | 475201 | 7.9% |
| R | 440756 | 7.4% |
| I | 334220 | 5.6% |
| N | 291622 | 4.9% |
| O | 290828 | 4.9% |
| C | 276778 | 4.6% |
| U | 262111 | 4.4% |
| Other values (22) | 1524145 |
| Distinct | 366 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 3.2 MiB |
| 06/02/2020 | 1502 |
|---|---|
| 01/15/2020 | 1500 |
| 01/14/2020 | 1476 |
| 03/11/2020 | 1441 |
| 10/21/2020 | 1429 |
| Other values (361) |
Length
| Max length | 10 |
|---|---|
| Median length | 10 |
| Mean length | 10 |
| Min length | 10 |
Characters and Unicode
| Total characters | 4134120 |
|---|---|
| Distinct characters | 11 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 12/23/2020 |
|---|---|
| 2nd row | 12/21/2020 |
| 3rd row | 11/23/2020 |
| 4th row | 11/22/2020 |
| 5th row | 11/21/2020 |
| Value | Count | Frequency (%) |
| 06/02/2020 | 1502 | 0.4% |
| 01/15/2020 | 1500 | 0.4% |
| 01/14/2020 | 1476 | 0.4% |
| 03/11/2020 | 1441 | 0.3% |
| 10/21/2020 | 1429 | 0.3% |
| 02/05/2020 | 1420 | 0.3% |
| 09/08/2020 | 1417 | 0.3% |
| 02/03/2020 | 1416 | 0.3% |
| 01/02/2020 | 1415 | 0.3% |
| 02/18/2020 | 1412 | 0.3% |
| Other values (356) | 398984 |
| Value | Count | Frequency (%) |
| 06/02/2020 | 1502 | 0.4% |
| 01/15/2020 | 1500 | 0.4% |
| 01/14/2020 | 1476 | 0.4% |
| 03/11/2020 | 1441 | 0.3% |
| 10/21/2020 | 1429 | 0.3% |
| 02/05/2020 | 1420 | 0.3% |
| 09/08/2020 | 1417 | 0.3% |
| 02/03/2020 | 1416 | 0.3% |
| 01/02/2020 | 1415 | 0.3% |
| 02/18/2020 | 1412 | 0.3% |
| Other values (356) | 398984 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 1334142 | |
| 2 | 1072786 | |
| / | 826824 | |
| 1 | 366157 | 8.9% |
| 3 | 93678 | 2.3% |
| 8 | 78591 | 1.9% |
| 7 | 76493 | 1.9% |
| 9 | 76006 | 1.8% |
| 5 | 72256 | 1.7% |
| 6 | 72121 | 1.7% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 3307296 | |
| Other Punctuation | 826824 | 20.0% |
Most frequent character per category
| Value | Count | Frequency (%) |
| 0 | 1334142 | |
| 2 | 1072786 | |
| 1 | 366157 | 11.1% |
| 3 | 93678 | 2.8% |
| 8 | 78591 | 2.4% |
| 7 | 76493 | 2.3% |
| 9 | 76006 | 2.3% |
| 5 | 72256 | 2.2% |
| 6 | 72121 | 2.2% |
| 4 | 65066 | 2.0% |
| Value | Count | Frequency (%) |
| / | 826824 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 4134120 |
Most frequent character per script
| Value | Count | Frequency (%) |
| 0 | 1334142 | |
| 2 | 1072786 | |
| / | 826824 | |
| 1 | 366157 | 8.9% |
| 3 | 93678 | 2.3% |
| 8 | 78591 | 1.9% |
| 7 | 76493 | 1.9% |
| 9 | 76006 | 1.8% |
| 5 | 72256 | 1.7% |
| 6 | 72121 | 1.7% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 4134120 |
Most frequent character per block
| Value | Count | Frequency (%) |
| 0 | 1334142 | |
| 2 | 1072786 | |
| / | 826824 | |
| 1 | 366157 | 8.9% |
| 3 | 93678 | 2.3% |
| 8 | 78591 | 1.9% |
| 7 | 76493 | 1.9% |
| 9 | 76006 | 1.8% |
| 5 | 72256 | 1.7% |
| 6 | 72121 | 1.7% |
| Distinct | 362 |
|---|---|
| Distinct (%) | 5.0% |
| Missing | 406179 |
| Missing (%) | 98.3% |
| Memory size | 3.2 MiB |
| 125 STREET | 250 |
|---|---|
| 34 ST.-PENN STATION | 148 |
| 14 STREET | 136 |
| 42 ST.-TIMES SQUARE | 133 |
| 59 ST.-COLUMBUS CIRCLE | 126 |
| Other values (357) |
Length
| Max length | 30 |
|---|---|
| Median length | 14 |
| Mean length | 15.60320752 |
| Min length | 6 |
Characters and Unicode
| Total characters | 112858 |
|---|---|
| Distinct characters | 42 |
| Distinct categories | 5 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 12 ? |
|---|---|
| Unique (%) | 0.2% |
Sample
| 1st row | 34 ST.-PENN STATION |
|---|---|
| 2nd row | WINTHROP STREET |
| 3rd row | PACIFIC STREET |
| 4th row | MAIN ST.-FLUSHING |
| 5th row | GRAND AVE.-NEWTON |
| Value | Count | Frequency (%) |
| 125 STREET | 250 | 0.1% |
| 34 ST.-PENN STATION | 148 | < 0.1% |
| 14 STREET | 136 | < 0.1% |
| 42 ST.-TIMES SQUARE | 133 | < 0.1% |
| 59 ST.-COLUMBUS CIRCLE | 126 | < 0.1% |
| 42 ST.-PORT AUTHORITY BUS TERM | 117 | < 0.1% |
| 161 ST.-YANKEE STADIUM | 97 | < 0.1% |
| 34 ST.-HERALD SQ. | 95 | < 0.1% |
| UTICA AVE.-CROWN HEIGHTS | 95 | < 0.1% |
| W. 4 STREET | 88 | < 0.1% |
| Other values (352) | 5948 | 1.4% |
| (Missing) | 406179 |
| Value | Count | Frequency (%) |
| street | 2581 | 14.9% |
| avenue | 1335 | 7.7% |
| 42 | 395 | 2.3% |
| 34 | 359 | 2.1% |
| 125 | 250 | 1.4% |
| square | 216 | 1.2% |
| east | 193 | 1.1% |
| 59 | 192 | 1.1% |
| road | 179 | 1.0% |
| st.-grand | 176 | 1.0% |
| Other values (433) | 11426 |
Most occurring characters
| Value | Count | Frequency (%) |
| E | 14002 | 12.4% |
| T | 11195 | 9.9% |
| 10069 | 8.9% | |
| S | 8183 | 7.3% |
| R | 7768 | 6.9% |
| A | 7598 | 6.7% |
| N | 5838 | 5.2% |
| O | 4144 | 3.7% |
| U | 3734 | 3.3% |
| L | 3179 | 2.8% |
| Other values (32) | 37148 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 89056 | |
| Space Separator | 10069 | 8.9% |
| Decimal Number | 8626 | 7.6% |
| Other Punctuation | 2704 | 2.4% |
| Dash Punctuation | 2403 | 2.1% |
Most frequent character per category
| Value | Count | Frequency (%) |
| E | 14002 | |
| T | 11195 | |
| S | 8183 | 9.2% |
| R | 7768 | 8.7% |
| A | 7598 | 8.5% |
| N | 5838 | 6.6% |
| O | 4144 | 4.7% |
| U | 3734 | 4.2% |
| L | 3179 | 3.6% |
| I | 2812 | 3.2% |
| Other values (16) | 20603 |
| Value | Count | Frequency (%) |
| 1 | 1908 | |
| 4 | 1436 | |
| 2 | 1054 | |
| 5 | 879 | |
| 3 | 842 | |
| 6 | 592 | 6.9% |
| 7 | 573 | 6.6% |
| 9 | 534 | 6.2% |
| 8 | 435 | 5.0% |
| 0 | 373 | 4.3% |
| Value | Count | Frequency (%) |
| . | 2399 | |
| / | 240 | 8.9% |
| " | 56 | 2.1% |
| ' | 9 | 0.3% |
| Value | Count | Frequency (%) |
| 10069 |
| Value | Count | Frequency (%) |
| - | 2403 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 89056 | |
| Common | 23802 | 21.1% |
Most frequent character per script
| Value | Count | Frequency (%) |
| E | 14002 | |
| T | 11195 | |
| S | 8183 | 9.2% |
| R | 7768 | 8.7% |
| A | 7598 | 8.5% |
| N | 5838 | 6.6% |
| O | 4144 | 4.7% |
| U | 3734 | 4.2% |
| L | 3179 | 3.6% |
| I | 2812 | 3.2% |
| Other values (16) | 20603 |
| Value | Count | Frequency (%) |
| 10069 | ||
| - | 2403 | 10.1% |
| . | 2399 | 10.1% |
| 1 | 1908 | 8.0% |
| 4 | 1436 | 6.0% |
| 2 | 1054 | 4.4% |
| 5 | 879 | 3.7% |
| 3 | 842 | 3.5% |
| 6 | 592 | 2.5% |
| 7 | 573 | 2.4% |
| Other values (6) | 1647 | 6.9% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 112858 |
Most frequent character per block
| Value | Count | Frequency (%) |
| E | 14002 | 12.4% |
| T | 11195 | 9.9% |
| 10069 | 8.9% | |
| S | 8183 | 7.3% |
| R | 7768 | 6.9% |
| A | 7598 | 6.7% |
| N | 5838 | 5.2% |
| O | 4144 | 3.7% |
| U | 3734 | 3.3% |
| L | 3179 | 2.8% |
| Other values (32) | 37148 |
| Distinct | 17 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 94862 |
| Missing (%) | 22.9% |
| Memory size | 3.2 MiB |
| UNKNOWN | |
|---|---|
| 25-44 | |
| 45-64 | |
| 18-24 | |
| <18 | 6632 |
| Other values (12) | 3338 |
Length
| Max length | 7 |
|---|---|
| Median length | 5 |
| Mean length | 5.845289593 |
| Min length | 3 |
Characters and Unicode
| Total characters | 1862017 |
|---|---|
| Distinct characters | 17 |
| Distinct categories | 4 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 9 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | UNKNOWN |
|---|---|
| 2nd row | 25-44 |
| 3rd row | 25-44 |
| 4th row | <18 |
| 5th row | 25-44 |
| Value | Count | Frequency (%) |
| UNKNOWN | 144594 | |
| 25-44 | 100390 | |
| 45-64 | 33930 | 8.2% |
| 18-24 | 29666 | 7.2% |
| <18 | 6632 | 1.6% |
| 65+ | 3317 | 0.8% |
| 2020 | 10 | < 0.1% |
| 2019 | 2 | < 0.1% |
| 1925 | 1 | < 0.1% |
| -977 | 1 | < 0.1% |
| Other values (7) | 7 | < 0.1% |
| (Missing) | 94862 |
| Value | Count | Frequency (%) |
| unknown | 144594 | |
| 25-44 | 100390 | |
| 45-64 | 33930 | 10.7% |
| 18-24 | 29666 | 9.3% |
| 18 | 6632 | 2.1% |
| 65 | 3317 | 1.0% |
| 2020 | 10 | < 0.1% |
| 2019 | 2 | < 0.1% |
| 942 | 1 | < 0.1% |
| 1020 | 1 | < 0.1% |
| Other values (7) | 7 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| N | 433782 | |
| 4 | 298307 | |
| - | 163993 | 8.8% |
| U | 144594 | 7.8% |
| K | 144594 | 7.8% |
| O | 144594 | 7.8% |
| W | 144594 | 7.8% |
| 5 | 137639 | 7.4% |
| 2 | 130084 | 7.0% |
| 6 | 37249 | 2.0% |
| Other values (7) | 82587 | 4.4% |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 1012158 | |
| Decimal Number | 675917 | |
| Dash Punctuation | 163993 | 8.8% |
| Math Symbol | 9949 | 0.5% |
Most frequent character per category
| Value | Count | Frequency (%) |
| 4 | 298307 | |
| 5 | 137639 | |
| 2 | 130084 | |
| 6 | 37249 | 5.5% |
| 1 | 36304 | 5.4% |
| 8 | 36299 | 5.4% |
| 0 | 24 | < 0.1% |
| 9 | 8 | < 0.1% |
| 7 | 3 | < 0.1% |
| Value | Count | Frequency (%) |
| N | 433782 | |
| U | 144594 | 14.3% |
| K | 144594 | 14.3% |
| O | 144594 | 14.3% |
| W | 144594 | 14.3% |
| Value | Count | Frequency (%) |
| < | 6632 | |
| + | 3317 |
| Value | Count | Frequency (%) |
| - | 163993 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1012158 | |
| Common | 849859 |
Most frequent character per script
| Value | Count | Frequency (%) |
| 4 | 298307 | |
| - | 163993 | |
| 5 | 137639 | |
| 2 | 130084 | |
| 6 | 37249 | 4.4% |
| 1 | 36304 | 4.3% |
| 8 | 36299 | 4.3% |
| < | 6632 | 0.8% |
| + | 3317 | 0.4% |
| 0 | 24 | < 0.1% |
| Other values (2) | 11 | < 0.1% |
| Value | Count | Frequency (%) |
| N | 433782 | |
| U | 144594 | 14.3% |
| K | 144594 | 14.3% |
| O | 144594 | 14.3% |
| W | 144594 | 14.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1862017 |
Most frequent character per block
| Value | Count | Frequency (%) |
| N | 433782 | |
| 4 | 298307 | |
| - | 163993 | 8.8% |
| U | 144594 | 7.8% |
| K | 144594 | 7.8% |
| O | 144594 | 7.8% |
| W | 144594 | 7.8% |
| 5 | 137639 | 7.4% |
| 2 | 130084 | 7.0% |
| 6 | 37249 | 2.0% |
| Other values (7) | 82587 | 4.4% |
| Distinct | 7 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 94862 |
| Missing (%) | 22.9% |
| Memory size | 3.2 MiB |
| BLACK | |
|---|---|
| UNKNOWN | |
| WHITE HISPANIC | |
| WHITE | |
| BLACK HISPANIC | |
| Other values (2) | 11481 |
Length
| Max length | 30 |
|---|---|
| Median length | 7 |
| Mean length | 8.180847591 |
| Min length | 5 |
Characters and Unicode
| Total characters | 2606009 |
|---|---|
| Distinct characters | 22 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | UNKNOWN |
|---|---|
| 2nd row | BLACK |
| 3rd row | BLACK |
| 4th row | WHITE HISPANIC |
| 5th row | WHITE |
| Value | Count | Frequency (%) |
| BLACK | 115253 | |
| UNKNOWN | 96066 | |
| WHITE HISPANIC | 50625 | |
| WHITE | 29164 | 7.1% |
| BLACK HISPANIC | 15961 | 3.9% |
| ASIAN / PACIFIC ISLANDER | 10862 | 2.6% |
| AMERICAN INDIAN/ALASKAN NATIVE | 619 | 0.1% |
| (Missing) | 94862 |
| Value | Count | Frequency (%) |
| black | 131214 | |
| unknown | 96066 | |
| white | 79789 | |
| hispanic | 66586 | |
| pacific | 10862 | 2.6% |
| islander | 10862 | 2.6% |
| asian | 10862 | 2.6% |
| 10862 | 2.6% | |
| american | 619 | 0.1% |
| native | 619 | 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| N | 379603 | |
| I | 258885 | |
| A | 245581 | |
| K | 227899 | 8.7% |
| C | 220143 | 8.4% |
| W | 175855 | 6.7% |
| H | 146375 | 5.6% |
| L | 142695 | 5.5% |
| B | 131214 | 5.0% |
| 100410 | 3.9% | |
| Other values (12) | 577349 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 2494118 | |
| Space Separator | 100410 | 3.9% |
| Other Punctuation | 11481 | 0.4% |
Most frequent character per category
| Value | Count | Frequency (%) |
| N | 379603 | |
| I | 258885 | |
| A | 245581 | |
| K | 227899 | |
| C | 220143 | |
| W | 175855 | 7.1% |
| H | 146375 | 5.9% |
| L | 142695 | 5.7% |
| B | 131214 | 5.3% |
| U | 96066 | 3.9% |
| Other values (10) | 469802 |
| Value | Count | Frequency (%) |
| 100410 |
| Value | Count | Frequency (%) |
| / | 11481 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 2494118 | |
| Common | 111891 | 4.3% |
Most frequent character per script
| Value | Count | Frequency (%) |
| N | 379603 | |
| I | 258885 | |
| A | 245581 | |
| K | 227899 | |
| C | 220143 | |
| W | 175855 | 7.1% |
| H | 146375 | 5.9% |
| L | 142695 | 5.7% |
| B | 131214 | 5.3% |
| U | 96066 | 3.9% |
| Other values (10) | 469802 |
| Value | Count | Frequency (%) |
| 100410 | ||
| / | 11481 | 10.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2606009 |
Most frequent character per block
| Value | Count | Frequency (%) |
| N | 379603 | |
| I | 258885 | |
| A | 245581 | |
| K | 227899 | 8.7% |
| C | 220143 | 8.4% |
| W | 175855 | 6.7% |
| H | 146375 | 5.6% |
| L | 142695 | 5.5% |
| B | 131214 | 5.0% |
| 100410 | 3.9% | |
| Other values (12) | 577349 |
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 94862 |
| Missing (%) | 22.9% |
| Memory size | 3.2 MiB |
| M | |
|---|---|
| U | |
| F |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 318550 |
|---|---|
| Distinct characters | 3 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | U |
|---|---|
| 2nd row | M |
| 3rd row | M |
| 4th row | M |
| 5th row | M |
| Value | Count | Frequency (%) |
| M | 186273 | |
| U | 82849 | |
| F | 49428 | 12.0% |
| (Missing) | 94862 |
| Value | Count | Frequency (%) |
| m | 186273 | |
| u | 82849 | |
| f | 49428 | 15.5% |
Most occurring characters
| Value | Count | Frequency (%) |
| M | 186273 | |
| U | 82849 | |
| F | 49428 | 15.5% |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 318550 |
Most frequent character per category
| Value | Count | Frequency (%) |
| M | 186273 | |
| U | 82849 | |
| F | 49428 | 15.5% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 318550 |
Most frequent character per script
| Value | Count | Frequency (%) |
| M | 186273 | |
| U | 82849 | |
| F | 49428 | 15.5% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 318550 |
Most frequent character per block
| Value | Count | Frequency (%) |
| M | 186273 | |
| U | 82849 | |
| F | 49428 | 15.5% |
| Distinct | 12 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 406179 |
| Missing (%) | 98.3% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 13.85275819 |
|---|---|
| Minimum | 1 |
| Maximum | 34 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 3.2 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 2 |
| median | 11 |
| Q3 | 30 |
| 95-th percentile | 33 |
| Maximum | 34 |
| Range | 33 |
| Interquartile range (IQR) | 28 |
Descriptive statistics
| Standard deviation | 12.57269892 |
|---|---|
| Coefficient of variation (CV) | 0.9075953503 |
| Kurtosis | -1.394793232 |
| Mean | 13.85275819 |
| Median Absolute Deviation (MAD) | 9 |
| Skewness | 0.5144579587 |
| Sum | 100197 |
| Variance | 158.0727582 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 2 | 1023 | 0.2% |
| 4 | 881 | 0.2% |
| 1 | 830 | 0.2% |
| 20 | 653 | 0.2% |
| 3 | 623 | 0.2% |
| 33 | 597 | 0.1% |
| 11 | 593 | 0.1% |
| 32 | 583 | 0.1% |
| 12 | 562 | 0.1% |
| 30 | 462 | 0.1% |
| Other values (2) | 426 | 0.1% |
| (Missing) | 406179 |
| Value | Count | Frequency (%) |
| 1 | 830 | |
| 2 | 1023 | |
| 3 | 623 | |
| 4 | 881 | |
| 11 | 593 |
| Value | Count | Frequency (%) |
| 34 | 326 | |
| 33 | 597 | |
| 32 | 583 | |
| 30 | 462 | |
| 23 | 100 | < 0.1% |
VIC_AGE_GROUP
Categorical
| Distinct | 26 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 1 |
| Missing (%) | < 0.1% |
| Memory size | 3.2 MiB |
| 25-44 | |
|---|---|
| UNKNOWN | |
| 45-64 | |
| 18-24 | |
| 65+ | |
| Other values (21) | 12257 |
Length
| Max length | 7 |
|---|---|
| Median length | 5 |
| Mean length | 5.333735677 |
| Min length | 2 |
Characters and Unicode
| Total characters | 2205025 |
|---|---|
| Distinct characters | 18 |
| Distinct categories | 4 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 19 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | 18-24 |
|---|---|
| 2nd row | 25-44 |
| 3rd row | 25-44 |
| 4th row | 25-44 |
| 5th row | 18-24 |
| Value | Count | Frequency (%) |
| 25-44 | 159768 | |
| UNKNOWN | 100179 | |
| 45-64 | 84566 | |
| 18-24 | 37700 | 9.1% |
| 65+ | 18941 | 4.6% |
| <18 | 12236 | 3.0% |
| -948 | 2 | < 0.1% |
| -963 | 1 | < 0.1% |
| -958 | 1 | < 0.1% |
| 950 | 1 | < 0.1% |
| Other values (16) | 16 | < 0.1% |
| Value | Count | Frequency (%) |
| 25-44 | 159768 | |
| unknown | 100179 | |
| 45-64 | 84566 | |
| 18-24 | 37700 | 9.1% |
| 65 | 18941 | 4.6% |
| 18 | 12236 | 3.0% |
| 938 | 2 | < 0.1% |
| 948 | 2 | < 0.1% |
| 968 | 1 | < 0.1% |
| 973 | 1 | < 0.1% |
| Other values (15) | 15 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 4 | 526374 | |
| N | 300537 | |
| - | 282048 | |
| 5 | 263278 | |
| 2 | 197470 | 9.0% |
| 6 | 103513 | 4.7% |
| U | 100179 | 4.5% |
| K | 100179 | 4.5% |
| O | 100179 | 4.5% |
| W | 100179 | 4.5% |
| Other values (8) | 131089 | 5.9% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 1190547 | |
| Uppercase Letter | 701253 | |
| Dash Punctuation | 282048 | 12.8% |
| Math Symbol | 31177 | 1.4% |
Most frequent character per category
| Value | Count | Frequency (%) |
| 4 | 526374 | |
| 5 | 263278 | |
| 2 | 197470 | 16.6% |
| 6 | 103513 | 8.7% |
| 1 | 49942 | 4.2% |
| 8 | 49942 | 4.2% |
| 9 | 15 | < 0.1% |
| 3 | 7 | < 0.1% |
| 0 | 4 | < 0.1% |
| 7 | 2 | < 0.1% |
| Value | Count | Frequency (%) |
| N | 300537 | |
| U | 100179 | 14.3% |
| K | 100179 | 14.3% |
| O | 100179 | 14.3% |
| W | 100179 | 14.3% |
| Value | Count | Frequency (%) |
| + | 18941 | |
| < | 12236 |
| Value | Count | Frequency (%) |
| - | 282048 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 1503772 | |
| Latin | 701253 |
Most frequent character per script
| Value | Count | Frequency (%) |
| 4 | 526374 | |
| - | 282048 | |
| 5 | 263278 | |
| 2 | 197470 | 13.1% |
| 6 | 103513 | 6.9% |
| 1 | 49942 | 3.3% |
| 8 | 49942 | 3.3% |
| + | 18941 | 1.3% |
| < | 12236 | 0.8% |
| 9 | 15 | < 0.1% |
| Other values (3) | 13 | < 0.1% |
| Value | Count | Frequency (%) |
| N | 300537 | |
| U | 100179 | 14.3% |
| K | 100179 | 14.3% |
| O | 100179 | 14.3% |
| W | 100179 | 14.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2205025 |
Most frequent character per block
| Value | Count | Frequency (%) |
| 4 | 526374 | |
| N | 300537 | |
| - | 282048 | |
| 5 | 263278 | |
| 2 | 197470 | 9.0% |
| 6 | 103513 | 4.7% |
| U | 100179 | 4.5% |
| K | 100179 | 4.5% |
| O | 100179 | 4.5% |
| W | 100179 | 4.5% |
| Other values (8) | 131089 | 5.9% |
VIC_RACE
Categorical
| Distinct | 7 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 1 |
| Missing (%) | < 0.1% |
| Memory size | 3.2 MiB |
| UNKNOWN | |
|---|---|
| BLACK | |
| WHITE HISPANIC | |
| WHITE | |
| ASIAN / PACIFIC ISLANDER | |
| Other values (2) |
Length
| Max length | 30 |
|---|---|
| Median length | 7 |
| Mean length | 9.147073977 |
| Min length | 5 |
Characters and Unicode
| Total characters | 3781501 |
|---|---|
| Distinct characters | 22 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | BLACK |
|---|---|
| 2nd row | BLACK |
| 3rd row | BLACK |
| 4th row | BLACK |
| 5th row | BLACK HISPANIC |
| Value | Count | Frequency (%) |
| UNKNOWN | 109813 | |
| BLACK | 109704 | |
| WHITE HISPANIC | 76211 | |
| WHITE | 66066 | |
| ASIAN / PACIFIC ISLANDER | 32312 | 7.8% |
| BLACK HISPANIC | 17977 | 4.3% |
| AMERICAN INDIAN/ALASKAN NATIVE | 1328 | 0.3% |
| (Missing) | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| white | 142277 | |
| black | 127681 | |
| unknown | 109813 | |
| hispanic | 94188 | |
| pacific | 32312 | 5.3% |
| islander | 32312 | 5.3% |
| asian | 32312 | 5.3% |
| 32312 | 5.3% | |
| american | 1328 | 0.2% |
| native | 1328 | 0.2% |
Most occurring characters
| Value | Count | Frequency (%) |
| N | 494891 | |
| I | 465213 | |
| A | 360413 | 9.5% |
| C | 287821 | 7.6% |
| W | 252090 | 6.7% |
| K | 238822 | 6.3% |
| H | 236465 | 6.3% |
| 193780 | 5.1% | |
| E | 177245 | 4.7% |
| L | 161321 | 4.3% |
| Other values (12) | 913440 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 3554081 | |
| Space Separator | 193780 | 5.1% |
| Other Punctuation | 33640 | 0.9% |
Most frequent character per category
| Value | Count | Frequency (%) |
| N | 494891 | |
| I | 465213 | |
| A | 360413 | |
| C | 287821 | 8.1% |
| W | 252090 | 7.1% |
| K | 238822 | 6.7% |
| H | 236465 | 6.7% |
| E | 177245 | 5.0% |
| L | 161321 | 4.5% |
| S | 160140 | 4.5% |
| Other values (10) | 719660 |
| Value | Count | Frequency (%) |
| 193780 |
| Value | Count | Frequency (%) |
| / | 33640 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 3554081 | |
| Common | 227420 | 6.0% |
Most frequent character per script
| Value | Count | Frequency (%) |
| N | 494891 | |
| I | 465213 | |
| A | 360413 | |
| C | 287821 | 8.1% |
| W | 252090 | 7.1% |
| K | 238822 | 6.7% |
| H | 236465 | 6.7% |
| E | 177245 | 5.0% |
| L | 161321 | 4.5% |
| S | 160140 | 4.5% |
| Other values (10) | 719660 |
| Value | Count | Frequency (%) |
| 193780 | ||
| / | 33640 | 14.8% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 3781501 |
Most frequent character per block
| Value | Count | Frequency (%) |
| N | 494891 | |
| I | 465213 | |
| A | 360413 | 9.5% |
| C | 287821 | 7.6% |
| W | 252090 | 6.7% |
| K | 238822 | 6.3% |
| H | 236465 | 6.3% |
| 193780 | 5.1% | |
| E | 177245 | 4.7% |
| L | 161321 | 4.3% |
| Other values (12) | 913440 |
VIC_SEX
Categorical
| Distinct | 4 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 1 |
| Missing (%) | < 0.1% |
| Memory size | 3.2 MiB |
| F | |
|---|---|
| M | |
| D | |
| E |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 413411 |
|---|---|
| Distinct characters | 4 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | M |
|---|---|
| 2nd row | M |
| 3rd row | F |
| 4th row | F |
| 5th row | M |
| Value | Count | Frequency (%) |
| F | 164503 | |
| M | 152219 | |
| D | 63160 | 15.3% |
| E | 33529 | 8.1% |
| (Missing) | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| f | 164503 | |
| m | 152219 | |
| d | 63160 | 15.3% |
| e | 33529 | 8.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| F | 164503 | |
| M | 152219 | |
| D | 63160 | 15.3% |
| E | 33529 | 8.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 413411 |
Most frequent character per category
| Value | Count | Frequency (%) |
| F | 164503 | |
| M | 152219 | |
| D | 63160 | 15.3% |
| E | 33529 | 8.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 413411 |
Most frequent character per script
| Value | Count | Frequency (%) |
| F | 164503 | |
| M | 152219 | |
| D | 63160 | 15.3% |
| E | 33529 | 8.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 413411 |
Most frequent character per block
| Value | Count | Frequency (%) |
| F | 164503 | |
| M | 152219 | |
| D | 63160 | 15.3% |
| E | 33529 | 8.1% |
| Distinct | 47204 |
|---|---|
| Distinct (%) | 11.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1005765.402 |
|---|---|
| Minimum | 913411 |
| Maximum | 1067185 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 3.2 MiB |
Quantile statistics
| Minimum | 913411 |
|---|---|
| 5-th percentile | 979467 |
| Q1 | 993253 |
| median | 1005026 |
| Q3 | 1017356 |
| 95-th percentile | 1043751.35 |
| Maximum | 1067185 |
| Range | 153774 |
| Interquartile range (IQR) | 24103 |
Descriptive statistics
| Standard deviation | 21261.6037 |
|---|---|
| Coefficient of variation (CV) | 0.02113972468 |
| Kurtosis | 1.507604224 |
| Mean | 1005765.402 |
| Median Absolute Deviation (MAD) | 12093 |
| Skewness | -0.2529144364 |
| Sum | 4.157954865 × 1011 |
| Variance | 452055791.7 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 987220 | 939 | 0.2% |
| 989211 | 537 | 0.1% |
| 1019840 | 412 | 0.1% |
| 991655 | 405 | 0.1% |
| 997880 | 399 | 0.1% |
| 1005725 | 386 | 0.1% |
| 1006537 | 326 | 0.1% |
| 1004138 | 310 | 0.1% |
| 1017141 | 304 | 0.1% |
| 1020754 | 302 | 0.1% |
| Other values (47194) | 409092 |
| Value | Count | Frequency (%) |
| 913411 | 1 | |
| 913512 | 2 | |
| 913784 | 1 | |
| 913819 | 1 | |
| 913853 | 1 |
| Value | Count | Frequency (%) |
| 1067185 | 10 | |
| 1067117 | 1 | < 0.1% |
| 1067083 | 1 | < 0.1% |
| 1067053 | 4 | < 0.1% |
| 1067000 | 1 | < 0.1% |
| Distinct | 50133 |
|---|---|
| Distinct (%) | 12.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 207757.3739 |
|---|---|
| Minimum | 121131 |
| Maximum | 271820 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 3.2 MiB |
Quantile statistics
| Minimum | 121131 |
|---|---|
| 5-th percentile | 157683 |
| Q1 | 185117.5 |
| median | 206689 |
| Q3 | 235517 |
| 95-th percentile | 255040 |
| Maximum | 271820 |
| Range | 150689 |
| Interquartile range (IQR) | 50399.5 |
Descriptive statistics
| Standard deviation | 30289.58164 |
|---|---|
| Coefficient of variation (CV) | 0.1457930521 |
| Kurtosis | -0.8934401953 |
| Mean | 207757.3739 |
| Median Absolute Deviation (MAD) | 24288 |
| Skewness | -0.03340722637 |
| Sum | 8.588939146 × 1010 |
| Variance | 917458755.7 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 212676 | 944 | 0.2% |
| 222871 | 527 | 0.1% |
| 206689 | 411 | 0.1% |
| 213390 | 410 | 0.1% |
| 192557 | 399 | 0.1% |
| 249742 | 378 | 0.1% |
| 244511 | 328 | 0.1% |
| 183798 | 313 | 0.1% |
| 209365 | 304 | 0.1% |
| 215043 | 301 | 0.1% |
| Other values (50123) | 409097 |
| Value | Count | Frequency (%) |
| 121131 | 2 | |
| 121508 | 2 | |
| 121611 | 4 | |
| 121674 | 4 | |
| 121736 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 271820 | 11 | |
| 271730 | 1 | < 0.1% |
| 271551 | 1 | < 0.1% |
| 271424 | 1 | < 0.1% |
| 271304 | 7 |
| Distinct | 67399 |
|---|---|
| Distinct (%) | 16.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 40.73687867 |
|---|---|
| Minimum | 40.49890536 |
| Maximum | 40.9127234 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 3.2 MiB |
Quantile statistics
| Minimum | 40.49890536 |
|---|---|
| 5-th percentile | 40.59944719 |
| Q1 | 40.67474383 |
| median | 40.73394232 |
| Q3 | 40.81310735 |
| 95-th percentile | 40.86664779 |
| Maximum | 40.9127234 |
| Range | 0.413818033 |
| Interquartile range (IQR) | 0.138363516 |
Descriptive statistics
| Standard deviation | 0.08314194231 |
|---|---|
| Coefficient of variation (CV) | 0.00204095024 |
| Kurtosis | -0.8934189037 |
| Mean | 40.73687867 |
| Median Absolute Deviation (MAD) | 0.0666757345 |
| Skewness | -0.03363583397 |
| Sum | 16841114.48 |
| Variance | 0.006912582571 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 40.75043077 | 937 | 0.2% |
| 40.77841252 | 526 | 0.1% |
| 40.73392684 | 411 | 0.1% |
| 40.75238792 | 405 | 0.1% |
| 40.69519898 | 396 | 0.1% |
| 40.85214119 | 378 | 0.1% |
| 40.83778162 | 317 | 0.1% |
| 40.67110691 | 304 | 0.1% |
| 40.74134137 | 301 | 0.1% |
| 40.6517009 | 300 | 0.1% |
| Other values (67389) | 409137 |
| Value | Count | Frequency (%) |
| 40.49890536 | 2 | |
| 40.49994754 | 2 | |
| 40.50021598 | 4 | |
| 40.50039077 | 4 | |
| 40.50056279 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 40.9127234 | 11 | |
| 40.91247643 | 1 | < 0.1% |
| 40.91198211 | 1 | < 0.1% |
| 40.91163433 | 1 | < 0.1% |
| 40.91130746 | 7 |
| Distinct | 67400 |
|---|---|
| Distinct (%) | 16.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | -73.9223374 |
|---|---|
| Minimum | -74.25474319 |
| Maximum | -73.70072029 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 3.2 MiB |
Quantile statistics
| Minimum | -74.25474319 |
|---|---|
| 5-th percentile | -74.01723611 |
| Q1 | -73.96750176 |
| median | -73.92500172 |
| Q3 | -73.88039168 |
| 95-th percentile | -73.7854175 |
| Maximum | -73.70072029 |
| Range | 0.554022895 |
| Interquartile range (IQR) | 0.087110078 |
Descriptive statistics
| Standard deviation | 0.07667732122 |
|---|---|
| Coefficient of variation (CV) | -0.001037268624 |
| Kurtosis | 1.495141859 |
| Mean | -73.9223374 |
| Median Absolute Deviation (MAD) | 0.043568766 |
| Skewness | -0.2519878689 |
| Sum | -30560381.35 |
| Variance | 0.005879411589 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| -73.98928218 | 937 | 0.2% |
| -73.98208877 | 526 | 0.1% |
| -73.8715824 | 411 | 0.1% |
| -73.97327466 | 405 | 0.1% |
| -73.95084903 | 396 | 0.1% |
| -73.92237572 | 378 | 0.1% |
| -73.91945797 | 317 | 0.1% |
| -73.88143296 | 304 | 0.1% |
| -73.97839261 | 301 | 0.1% |
| -73.86844675 | 300 | 0.1% |
| Other values (67390) | 409137 |
| Value | Count | Frequency (%) |
| -74.25474319 | 1 | |
| -74.254377 | 2 | |
| -74.25340303 | 1 | |
| -74.25325746 | 1 | |
| -74.25314827 | 1 |
| Value | Count | Frequency (%) |
| -73.70072029 | 10 | |
| -73.7009566 | 1 | < 0.1% |
| -73.70107442 | 1 | < 0.1% |
| -73.7011785 | 4 | < 0.1% |
| -73.70138638 | 1 | < 0.1% |
| Distinct | 67403 |
|---|---|
| Distinct (%) | 16.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 3.2 MiB |
| (40.75043076800005, -73.98928217599996) | 937 |
|---|---|
| (40.77841252300004, -73.98208876999998) | 526 |
| (40.73392684100002, -73.87158239799999) | 411 |
| (40.75238791700008, -73.97327466399997) | 405 |
| (40.69519897600002, -73.95084903199995) | 396 |
| Other values (67398) |
Length
| Max length | 40 |
|---|---|
| Median length | 39 |
| Mean length | 39.03702118 |
| Min length | 31 |
Characters and Unicode
| Total characters | 16138373 |
|---|---|
| Distinct characters | 16 |
| Distinct categories | 6 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 20593 ? |
|---|---|
| Unique (%) | 5.0% |
Sample
| 1st row | (40.62576896100006, -73.99141682199996) |
|---|---|
| 2nd row | (40.67458330800008, -73.93022154099998) |
| 3rd row | (40.82310129900002, -73.86969046099993) |
| 4th row | (40.88745131300004, -73.84760778699997) |
| 5th row | (40.80022202900005, -73.93084834199995) |
| Value | Count | Frequency (%) |
| (40.75043076800005, -73.98928217599996) | 937 | 0.2% |
| (40.77841252300004, -73.98208876999998) | 526 | 0.1% |
| (40.73392684100002, -73.87158239799999) | 411 | 0.1% |
| (40.75238791700008, -73.97327466399997) | 405 | 0.1% |
| (40.69519897600002, -73.95084903199995) | 396 | 0.1% |
| (40.85214118700002, -73.92237572199997) | 378 | 0.1% |
| (40.83778161800007, -73.91945797099999) | 317 | 0.1% |
| (40.67110691100004, -73.88143295699997) | 304 | 0.1% |
| (40.74134137300007, -73.97839260899997) | 301 | 0.1% |
| (40.65170090400005, -73.86844675099996) | 300 | 0.1% |
| Other values (67393) | 409137 |
| Value | Count | Frequency (%) |
| 40.75043076800005 | 937 | 0.1% |
| 73.98928217599996 | 937 | 0.1% |
| 73.98208876999998 | 526 | 0.1% |
| 40.77841252300004 | 526 | 0.1% |
| 73.87158239799999 | 411 | < 0.1% |
| 40.73392684100002 | 411 | < 0.1% |
| 40.75238791700008 | 405 | < 0.1% |
| 73.97327466399997 | 405 | < 0.1% |
| 73.95084903199995 | 396 | < 0.1% |
| 40.69519897600002 | 396 | < 0.1% |
| Other values (134789) | 821474 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 2757116 | |
| 9 | 2491983 | |
| 7 | 1407590 | |
| 4 | 1286633 | |
| 3 | 1140342 | |
| 8 | 1014050 | 6.3% |
| 6 | 935385 | 5.8% |
| 5 | 857928 | 5.3% |
| . | 826824 | 5.1% |
| 2 | 685119 | 4.2% |
| Other values (6) | 2735403 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 13244489 | |
| Other Punctuation | 1240236 | 7.7% |
| Open Punctuation | 413412 | 2.6% |
| Space Separator | 413412 | 2.6% |
| Dash Punctuation | 413412 | 2.6% |
| Close Punctuation | 413412 | 2.6% |
Most frequent character per category
| Value | Count | Frequency (%) |
| 0 | 2757116 | |
| 9 | 2491983 | |
| 7 | 1407590 | |
| 4 | 1286633 | |
| 3 | 1140342 | |
| 8 | 1014050 | 7.7% |
| 6 | 935385 | 7.1% |
| 5 | 857928 | 6.5% |
| 2 | 685119 | 5.2% |
| 1 | 668343 | 5.0% |
| Value | Count | Frequency (%) |
| . | 826824 | |
| , | 413412 |
| Value | Count | Frequency (%) |
| ( | 413412 |
| Value | Count | Frequency (%) |
| 413412 |
| Value | Count | Frequency (%) |
| - | 413412 |
| Value | Count | Frequency (%) |
| ) | 413412 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 16138373 |
Most frequent character per script
| Value | Count | Frequency (%) |
| 0 | 2757116 | |
| 9 | 2491983 | |
| 7 | 1407590 | |
| 4 | 1286633 | |
| 3 | 1140342 | |
| 8 | 1014050 | 6.3% |
| 6 | 935385 | 5.8% |
| 5 | 857928 | 5.3% |
| . | 826824 | 5.1% |
| 2 | 685119 | 4.2% |
| Other values (6) | 2735403 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 16138373 |
Most frequent character per block
| Value | Count | Frequency (%) |
| 0 | 2757116 | |
| 9 | 2491983 | |
| 7 | 1407590 | |
| 4 | 1286633 | |
| 3 | 1140342 | |
| 8 | 1014050 | 6.3% |
| 6 | 935385 | 5.8% |
| 5 | 857928 | 5.3% |
| . | 826824 | 5.1% |
| 2 | 685119 | 4.2% |
| Other values (6) | 2735403 |
| Distinct | 67403 |
|---|---|
| Distinct (%) | 16.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 3.2 MiB |
| POINT (-73.98928217599996 40.75043076800005) | 937 |
|---|---|
| POINT (-73.98208876999998 40.77841252300004) | 526 |
| POINT (-73.87158239799999 40.73392684100002) | 411 |
| POINT (-73.97327466399997 40.75238791700008) | 405 |
| POINT (-73.95084903199995 40.69519897600002) | 396 |
| Other values (67398) |
Length
| Max length | 45 |
|---|---|
| Median length | 44 |
| Mean length | 44.03702118 |
| Min length | 36 |
Characters and Unicode
| Total characters | 18205433 |
|---|---|
| Distinct characters | 20 |
| Distinct categories | 7 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 20593 ? |
|---|---|
| Unique (%) | 5.0% |
Sample
| 1st row | POINT (-73.99141682199996 40.62576896100006) |
|---|---|
| 2nd row | POINT (-73.93022154099998 40.67458330800008) |
| 3rd row | POINT (-73.86969046099993 40.82310129900002) |
| 4th row | POINT (-73.84760778699997 40.88745131300004) |
| 5th row | POINT (-73.93084834199995 40.80022202900005) |
| Value | Count | Frequency (%) |
| POINT (-73.98928217599996 40.75043076800005) | 937 | 0.2% |
| POINT (-73.98208876999998 40.77841252300004) | 526 | 0.1% |
| POINT (-73.87158239799999 40.73392684100002) | 411 | 0.1% |
| POINT (-73.97327466399997 40.75238791700008) | 405 | 0.1% |
| POINT (-73.95084903199995 40.69519897600002) | 396 | 0.1% |
| POINT (-73.92237572199997 40.85214118700002) | 378 | 0.1% |
| POINT (-73.91945797099999 40.83778161800007) | 317 | 0.1% |
| POINT (-73.88143295699997 40.67110691100004) | 304 | 0.1% |
| POINT (-73.97839260899997 40.74134137300007) | 301 | 0.1% |
| POINT (-73.86844675099996 40.65170090400005) | 300 | 0.1% |
| Other values (67393) | 409137 |
| Value | Count | Frequency (%) |
| point | 413412 | |
| 40.75043076800005 | 937 | 0.1% |
| 73.98928217599996 | 937 | 0.1% |
| 73.98208876999998 | 526 | < 0.1% |
| 40.77841252300004 | 526 | < 0.1% |
| 40.73392684100002 | 411 | < 0.1% |
| 73.87158239799999 | 411 | < 0.1% |
| 73.97327466399997 | 405 | < 0.1% |
| 40.75238791700008 | 405 | < 0.1% |
| 73.95084903199995 | 396 | < 0.1% |
| Other values (134790) | 821870 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 2757116 | |
| 9 | 2491983 | |
| 7 | 1407590 | 7.7% |
| 4 | 1286633 | 7.1% |
| 3 | 1140342 | 6.3% |
| 8 | 1014050 | 5.6% |
| 6 | 935385 | 5.1% |
| 5 | 857928 | 4.7% |
| 826824 | 4.5% | |
| . | 826824 | 4.5% |
| Other values (10) | 4660758 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 13244489 | |
| Uppercase Letter | 2067060 | 11.4% |
| Space Separator | 826824 | 4.5% |
| Other Punctuation | 826824 | 4.5% |
| Open Punctuation | 413412 | 2.3% |
| Dash Punctuation | 413412 | 2.3% |
| Close Punctuation | 413412 | 2.3% |
Most frequent character per category
| Value | Count | Frequency (%) |
| 0 | 2757116 | |
| 9 | 2491983 | |
| 7 | 1407590 | |
| 4 | 1286633 | |
| 3 | 1140342 | |
| 8 | 1014050 | 7.7% |
| 6 | 935385 | 7.1% |
| 5 | 857928 | 6.5% |
| 2 | 685119 | 5.2% |
| 1 | 668343 | 5.0% |
| Value | Count | Frequency (%) |
| P | 413412 | |
| O | 413412 | |
| I | 413412 | |
| N | 413412 | |
| T | 413412 |
| Value | Count | Frequency (%) |
| 826824 |
| Value | Count | Frequency (%) |
| ( | 413412 |
| Value | Count | Frequency (%) |
| - | 413412 |
| Value | Count | Frequency (%) |
| . | 826824 |
| Value | Count | Frequency (%) |
| ) | 413412 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 16138373 | |
| Latin | 2067060 | 11.4% |
Most frequent character per script
| Value | Count | Frequency (%) |
| 0 | 2757116 | |
| 9 | 2491983 | |
| 7 | 1407590 | |
| 4 | 1286633 | |
| 3 | 1140342 | |
| 8 | 1014050 | 6.3% |
| 6 | 935385 | 5.8% |
| 5 | 857928 | 5.3% |
| 826824 | 5.1% | |
| . | 826824 | 5.1% |
| Other values (5) | 2593698 |
| Value | Count | Frequency (%) |
| P | 413412 | |
| O | 413412 | |
| I | 413412 | |
| N | 413412 | |
| T | 413412 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 18205433 |
Most frequent character per block
| Value | Count | Frequency (%) |
| 0 | 2757116 | |
| 9 | 2491983 | |
| 7 | 1407590 | 7.7% |
| 4 | 1286633 | 7.1% |
| 3 | 1140342 | 6.3% |
| 8 | 1014050 | 5.6% |
| 6 | 935385 | 5.1% |
| 5 | 857928 | 4.7% |
| 826824 | 4.5% | |
| . | 826824 | 4.5% |
| Other values (10) | 4660758 |
Pearson's r
The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
Spearman's ρ
The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
Kendall's τ
Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
Phik (φk)
Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.Cramér's V (φc)
Cramér's V is an association measure for nominal random variables. The coefficient ranges from 0 to 1, with 0 indicating independence and 1 indicating perfect association. The empirical estimators used for Cramér's V have been proved to be biased, even for large samples. We use a bias-corrected measure that has been proposed by Bergsma in 2013 that can be found here.First rows
| CMPLNT_NUM | ADDR_PCT_CD | BORO_NM | CMPLNT_FR_DT | CMPLNT_FR_TM | CMPLNT_TO_DT | CMPLNT_TO_TM | CRM_ATPT_CPTD_CD | HADEVELOPT | HOUSING_PSA | JURISDICTION_CODE | JURIS_DESC | KY_CD | LAW_CAT_CD | LOC_OF_OCCUR_DESC | OFNS_DESC | PARKS_NM | PATROL_BORO | PD_CD | PD_DESC | PREM_TYP_DESC | RPT_DT | STATION_NAME | SUSP_AGE_GROUP | SUSP_RACE | SUSP_SEX | TRANSIT_DISTRICT | VIC_AGE_GROUP | VIC_RACE | VIC_SEX | X_COORD_CD | Y_COORD_CD | Latitude | Longitude | Lat_Lon | New Georeferenced Column | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 885776788 | 66 | NaN | 12/23/2020 | 19:50:00 | NaN | NaN | COMPLETED | NaN | NaN | NaN | N.Y. POLICE DEPT | 101 | FELONY | OUTSIDE | MURDER & NON-NEGL. MANSLAUGHTER | NaN | NaN | NaN | NaN | NaN | 12/23/2020 | NaN | NaN | NaN | NaN | NaN | 18-24 | BLACK | M | 986633 | 167258 | 40.625769 | -73.991417 | (40.62576896100006, -73.99141682199996) | POINT (-73.99141682199996 40.62576896100006) |
| 1 | 350637195 | 77 | NaN | 12/21/2020 | 01:10:00 | NaN | NaN | COMPLETED | NaN | NaN | NaN | N.Y. POLICE DEPT | 101 | FELONY | INSIDE | MURDER & NON-NEGL. MANSLAUGHTER | NaN | NaN | NaN | NaN | NaN | 12/21/2020 | NaN | NaN | NaN | NaN | NaN | 25-44 | BLACK | M | 1003606 | 185050 | 40.674583 | -73.930222 | (40.67458330800008, -73.93022154099998) | POINT (-73.93022154099998 40.67458330800008) |
| 2 | 347843168 | 43 | BRONX | 11/22/2020 | 22:00:00 | NaN | NaN | COMPLETED | NaN | NaN | 0.0 | N.Y. POLICE DEPT | 104 | FELONY | NaN | RAPE | NaN | PATROL BORO BRONX | 157.0 | RAPE 1 | STREET | 11/23/2020 | NaN | UNKNOWN | UNKNOWN | U | NaN | 25-44 | BLACK | F | 1020316 | 239179 | 40.823101 | -73.869690 | (40.82310129900002, -73.86969046099993) | POINT (-73.86969046099993 40.82310129900002) |
| 3 | 197941396 | 47 | NaN | 11/22/2020 | 09:50:00 | NaN | NaN | COMPLETED | NaN | NaN | NaN | N.Y. POLICE DEPT | 101 | FELONY | INSIDE | MURDER & NON-NEGL. MANSLAUGHTER | NaN | NaN | NaN | NaN | NaN | 11/22/2020 | NaN | 25-44 | BLACK | M | NaN | 25-44 | BLACK | F | 1026387 | 262634 | 40.887451 | -73.847608 | (40.88745131300004, -73.84760778699997) | POINT (-73.84760778699997 40.88745131300004) |
| 4 | 298404927 | 25 | NaN | 11/21/2020 | 15:38:00 | NaN | NaN | COMPLETED | NaN | NaN | NaN | N.Y. HOUSING POLICE | 101 | FELONY | OUTSIDE | MURDER & NON-NEGL. MANSLAUGHTER | NaN | NaN | NaN | NaN | NaN | 11/21/2020 | NaN | NaN | NaN | NaN | NaN | 18-24 | BLACK HISPANIC | M | 1003396 | 230824 | 40.800222 | -73.930848 | (40.80022202900005, -73.93084834199995) | POINT (-73.93084834199995 40.80022202900005) |
| 5 | 549342890 | 44 | NaN | 11/05/2020 | 09:40:00 | NaN | NaN | COMPLETED | NaN | NaN | NaN | N.Y. POLICE DEPT | 101 | FELONY | OUTSIDE | MURDER & NON-NEGL. MANSLAUGHTER | NaN | NaN | NaN | NaN | NaN | 11/05/2020 | NaN | NaN | NaN | NaN | NaN | 18-24 | WHITE HISPANIC | M | 1006434 | 244344 | 40.837324 | -73.919831 | (40.83732351100008, -73.91983075699994) | POINT (-73.91983075699994 40.83732351100008) |
| 6 | 921351410 | 28 | NaN | 11/04/2020 | 09:14:00 | NaN | NaN | COMPLETED | NaN | NaN | NaN | N.Y. POLICE DEPT | 101 | FELONY | OUTSIDE | MURDER & NON-NEGL. MANSLAUGHTER | NaN | NaN | NaN | NaN | NaN | 11/04/2020 | NaN | 25-44 | BLACK | M | NaN | 25-44 | BLACK | M | 997670 | 230545 | 40.799467 | -73.951531 | (40.799466801000044, -73.95153053599995) | POINT (-73.95153053599995 40.799466801000044) |
| 7 | 452350235 | 44 | NaN | 11/02/2020 | 18:30:00 | NaN | NaN | COMPLETED | NaN | NaN | NaN | N.Y. POLICE DEPT | 101 | FELONY | INSIDE | MURDER & NON-NEGL. MANSLAUGHTER | NaN | NaN | NaN | NaN | NaN | 11/02/2020 | NaN | <18 | WHITE HISPANIC | M | NaN | 18-24 | BLACK | F | 1006999 | 245897 | 40.841585 | -73.917784 | (40.841584606000026, -73.91778363799993) | POINT (-73.91778363799993 40.841584606000026) |
| 8 | 714801710 | 110 | NaN | 11/01/2020 | 01:20:00 | NaN | NaN | COMPLETED | NaN | NaN | NaN | N.Y. POLICE DEPT | 101 | FELONY | OUTSIDE | MURDER & NON-NEGL. MANSLAUGHTER | NaN | NaN | NaN | NaN | NaN | 11/01/2020 | NaN | NaN | NaN | NaN | NaN | 25-44 | WHITE HISPANIC | M | 1016573 | 210045 | 40.743151 | -73.883355 | (40.74315076400006, -73.88335454299995) | POINT (-73.88335454299995 40.74315076400006) |
| 9 | 985956017 | 63 | NaN | 10/31/2020 | 01:50:00 | NaN | NaN | COMPLETED | NaN | NaN | NaN | N.Y. POLICE DEPT | 101 | FELONY | OUTSIDE | MURDER & NON-NEGL. MANSLAUGHTER | NaN | NaN | NaN | NaN | NaN | 10/31/2020 | NaN | NaN | NaN | NaN | NaN | 25-44 | BLACK | M | 1005075 | 169926 | 40.633068 | -73.924972 | (40.63306790400002, -73.92497238099996) | POINT (-73.92497238099996 40.63306790400002) |
Last rows
| CMPLNT_NUM | ADDR_PCT_CD | BORO_NM | CMPLNT_FR_DT | CMPLNT_FR_TM | CMPLNT_TO_DT | CMPLNT_TO_TM | CRM_ATPT_CPTD_CD | HADEVELOPT | HOUSING_PSA | JURISDICTION_CODE | JURIS_DESC | KY_CD | LAW_CAT_CD | LOC_OF_OCCUR_DESC | OFNS_DESC | PARKS_NM | PATROL_BORO | PD_CD | PD_DESC | PREM_TYP_DESC | RPT_DT | STATION_NAME | SUSP_AGE_GROUP | SUSP_RACE | SUSP_SEX | TRANSIT_DISTRICT | VIC_AGE_GROUP | VIC_RACE | VIC_SEX | X_COORD_CD | Y_COORD_CD | Latitude | Longitude | Lat_Lon | New Georeferenced Column | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 413402 | 296310647 | 70 | BROOKLYN | 01/02/2020 | 18:00:00 | 01/02/2020 | 18:10:00 | COMPLETED | NaN | NaN | 0.0 | N.Y. POLICE DEPT | 351 | MISDEMEANOR | INSIDE | CRIMINAL MISCHIEF & RELATED OF | NaN | PATROL BORO BKLYN SOUTH | 259.0 | CRIMINAL MISCHIEF,UNCLASSIFIED 4 | RESIDENCE - APT. HOUSE | 01/02/2020 | NaN | UNKNOWN | BLACK HISPANIC | U | NaN | 45-64 | BLACK | F | 996193 | 176571 | 40.651323 | -73.956961 | (40.65132343000005, -73.95696100099997) | POINT (-73.95696100099997 40.65132343000005) |
| 413403 | 177612365 | 30 | MANHATTAN | 01/01/2020 | 00:30:00 | 01/01/2020 | 01:00:00 | COMPLETED | NaN | NaN | 0.0 | N.Y. POLICE DEPT | 106 | FELONY | NaN | FELONY ASSAULT | NaN | PATROL BORO MAN NORTH | 109.0 | ASSAULT 2,1,UNCLASSIFIED | STREET | 01/04/2020 | NaN | 25-44 | BLACK | M | NaN | 25-44 | BLACK | M | 997701 | 239900 | 40.825144 | -73.951400 | (40.825143625000074, -73.95139982299997) | POINT (-73.95139982299997 40.825143625000074) |
| 413404 | 710352058 | 106 | QUEENS | 12/16/2019 | 09:00:00 | 12/17/2019 | 17:00:00 | ATTEMPTED | NaN | NaN | 0.0 | N.Y. POLICE DEPT | 109 | FELONY | INSIDE | GRAND LARCENY | NaN | PATROL BORO QUEENS SOUTH | 430.0 | LARCENY,GRAND BY BANK ACCT COMPROMISE-UNCLASSIFIED | RESIDENCE-HOUSE | 01/03/2020 | NaN | NaN | NaN | NaN | NaN | 45-64 | WHITE | M | 1027072 | 185804 | 40.676570 | -73.845620 | (40.67657045900006, -73.84562010099995) | POINT (-73.84562010099995 40.67657045900006) |
| 413405 | 718164907 | 49 | BRONX | 01/05/2020 | 11:45:00 | 01/05/2020 | 11:57:00 | COMPLETED | NaN | NaN | 0.0 | N.Y. POLICE DEPT | 351 | MISDEMEANOR | INSIDE | CRIMINAL MISCHIEF & RELATED OF | NaN | PATROL BORO BRONX | 259.0 | CRIMINAL MISCHIEF,UNCLASSIFIED 4 | RESIDENCE-HOUSE | 01/05/2020 | NaN | 25-44 | WHITE | M | NaN | 25-44 | WHITE HISPANIC | F | 1023379 | 249358 | 40.851027 | -73.858564 | (40.85102663600002, -73.85856409399997) | POINT (-73.85856409399997 40.85102663600002) |
| 413406 | 339790489 | 14 | MANHATTAN | 01/04/2020 | 21:43:00 | 01/04/2020 | 21:44:00 | COMPLETED | NaN | NaN | 0.0 | N.Y. POLICE DEPT | 341 | MISDEMEANOR | INSIDE | PETIT LARCENY | NaN | PATROL BORO MAN SOUTH | 333.0 | LARCENY,PETIT FROM STORE-SHOPL | CLOTHING/BOUTIQUE | 01/05/2020 | NaN | UNKNOWN | BLACK | M | NaN | UNKNOWN | UNKNOWN | D | 987873 | 212315 | 40.749440 | -73.986926 | (40.74943967000007, -73.98692557399994) | POINT (-73.98692557399994 40.74943967000007) |
| 413407 | 947490808 | 13 | MANHATTAN | 01/04/2020 | 18:25:00 | 01/04/2020 | 18:28:00 | COMPLETED | NaN | NaN | 0.0 | N.Y. POLICE DEPT | 341 | MISDEMEANOR | INSIDE | PETIT LARCENY | NaN | PATROL BORO MAN SOUTH | 333.0 | LARCENY,PETIT FROM STORE-SHOPL | DEPARTMENT STORE | 01/04/2020 | NaN | UNKNOWN | WHITE HISPANIC | M | NaN | UNKNOWN | UNKNOWN | D | 990238 | 209365 | 40.741341 | -73.978393 | (40.74134137300007, -73.97839260899997) | POINT (-73.97839260899997 40.74134137300007) |
| 413408 | 913801459 | 102 | QUEENS | 01/02/2020 | 20:30:00 | 01/03/2020 | 06:30:00 | COMPLETED | NaN | NaN | 0.0 | N.Y. POLICE DEPT | 121 | FELONY | FRONT OF | CRIMINAL MISCHIEF & RELATED OF | NaN | PATROL BORO QUEENS SOUTH | 269.0 | MISCHIEF,CRIMINAL, UNCL 2ND | RESIDENCE-HOUSE | 01/03/2020 | NaN | UNKNOWN | UNKNOWN | U | NaN | 45-64 | ASIAN / PACIFIC ISLANDER | M | 1032404 | 190239 | 40.688716 | -73.826366 | (40.68871610400004, -73.82636559499997) | POINT (-73.82636559499997 40.68871610400004) |
| 413409 | 927013283 | 24 | MANHATTAN | 01/02/2020 | 09:32:00 | 01/02/2020 | 09:36:00 | COMPLETED | NaN | NaN | 0.0 | N.Y. POLICE DEPT | 107 | FELONY | INSIDE | BURGLARY | NaN | PATROL BORO MAN NORTH | 211.0 | BURGLARY,COMMERCIAL,DAY | CHAIN STORE | 01/02/2020 | NaN | 25-44 | BLACK | M | NaN | UNKNOWN | UNKNOWN | D | 991075 | 227074 | 40.789947 | -73.975354 | (40.78994739900003, -73.97535415699997) | POINT (-73.97535415699997 40.78994739900003) |
| 413410 | 844073735 | 50 | BRONX | 01/05/2020 | 12:55:00 | 01/05/2020 | 13:07:00 | COMPLETED | NaN | NaN | 0.0 | N.Y. POLICE DEPT | 347 | MISDEMEANOR | NaN | INTOXICATED & IMPAIRED DRIVING | NaN | PATROL BORO BRONX | 905.0 | INTOXICATED DRIVING,ALCOHOL | STREET | 01/05/2020 | NaN | 45-64 | WHITE HISPANIC | M | NaN | UNKNOWN | UNKNOWN | E | 1014211 | 260753 | 40.882338 | -73.891652 | (40.88233829700005, -73.89165215599996) | POINT (-73.89165215599996 40.88233829700005) |
| 413411 | 871721952 | 115 | QUEENS | 01/01/2020 | 05:20:00 | 01/01/2020 | 05:25:00 | COMPLETED | NaN | NaN | 0.0 | N.Y. POLICE DEPT | 106 | FELONY | INSIDE | FELONY ASSAULT | NaN | PATROL BORO QUEENS NORTH | 109.0 | ASSAULT 2,1,UNCLASSIFIED | RESIDENCE-HOUSE | 01/01/2020 | NaN | 25-44 | WHITE HISPANIC | F | NaN | 25-44 | WHITE HISPANIC | M | 1019201 | 212389 | 40.749574 | -73.873858 | (40.74957445600006, -73.87385847499998) | POINT (-73.87385847499998 40.74957445600006) |